diff options
author | Austin S. Hemmelgarn <ahferroin7@gmail.com> | 2019-11-15 13:14:43 -0500 |
---|---|---|
committer | Ilya Mashchenko <ilya@netdata.cloud> | 2019-11-15 21:14:43 +0300 |
commit | ef721290f03ec9d0292f062672feaad78d7fe3a9 (patch) | |
tree | 3a8e57a0117bfe166317dfcdd407ee861eb90ef1 /health/health.d/ram.conf | |
parent | ac90f27a580aad0c4d20c5354657b34e5ceed29e (diff) |
Fine tune various alarm values. (#7322)
* Fix formatting in alarm configurations.
This makes sure everything is lined up properly so that the alarm
definitions are easier to read.
* Make TCP Accept Queue alarms much less aggressive.
This switches the alarms to use averages instead of sums, and bumps up
the trip points to be more aggressive, as both of these may be non-zero
even in normal operation of a system.
* Make softnet alarms less aggressive.
This decreases the sampling window from 10 minutes to 1 minute, switches
to using an average instead of a sum, and adjusts the trigger thresholds
to be more aggressive.
This one will need to be watched, as the resultant values may be too
lenient for some systems.
* Tweak UDP alarms to work like the TCP alarms.
Just to ensure consistency.
Diffstat (limited to 'health/health.d/ram.conf')
-rw-r--r-- | health/health.d/ram.conf | 48 |
1 files changed, 24 insertions, 24 deletions
diff --git a/health/health.d/ram.conf b/health/health.d/ram.conf index 4e41bb496d..15e8e84640 100644 --- a/health/health.d/ram.conf +++ b/health/health.d/ram.conf @@ -37,28 +37,28 @@ to: sysadmin ## FreeBSD -alarm: ram_in_use - on: system.ram - os: freebsd -hosts: * - calc: ($active + $wired + $laundry + $buffers - $used_ram_to_ignore) * 100 / ($active + $wired + $laundry + $buffers - $used_ram_to_ignore + $cache + $free + $inactive) -units: % -every: 10s - warn: $this > (($status >= $WARNING) ? (80) : (90)) - crit: $this > (($status == $CRITICAL) ? (90) : (98)) -delay: down 15m multiplier 1.5 max 1h - info: system RAM usage - to: sysadmin + alarm: ram_in_use + on: system.ram + os: freebsd + hosts: * + calc: ($active + $wired + $laundry + $buffers - $used_ram_to_ignore) * 100 / ($active + $wired + $laundry + $buffers - $used_ram_to_ignore + $cache + $free + $inactive) + units: % + every: 10s + warn: $this > (($status >= $WARNING) ? (80) : (90)) + crit: $this > (($status == $CRITICAL) ? (90) : (98)) + delay: down 15m multiplier 1.5 max 1h + info: system RAM usage + to: sysadmin - alarm: ram_available - on: system.ram - os: freebsd - hosts: * - calc: ($free + $inactive + $used_ram_to_ignore) * 100 / ($free + $active + $inactive + $wired + $cache + $laundry + $buffers) - units: % - every: 10s - warn: $this < (($status >= $WARNING) ? (15) : (10)) - crit: $this < (($status == $CRITICAL) ? (10) : ( 5)) - delay: down 15m multiplier 1.5 max 1h - info: estimated amount of RAM available for userspace processes, without causing swapping - to: sysadmin + alarm: ram_available + on: system.ram + os: freebsd + hosts: * + calc: ($free + $inactive + $used_ram_to_ignore) * 100 / ($free + $active + $inactive + $wired + $cache + $laundry + $buffers) + units: % + every: 10s + warn: $this < (($status >= $WARNING) ? (15) : (10)) + crit: $this < (($status == $CRITICAL) ? (10) : ( 5)) + delay: down 15m multiplier 1.5 max 1h + info: estimated amount of RAM available for userspace processes, without causing swapping + to: sysadmin |