summaryrefslogtreecommitdiffstats
path: root/health
diff options
context:
space:
mode:
authorCosta Tsaousis <costa@netdata.cloud>2023-07-10 14:11:04 +0300
committerGitHub <noreply@github.com>2023-07-10 14:11:04 +0300
commiteb6f1de7c6f7b626cb2ec333697b63034115914b (patch)
treea28fe8d5530573771df1d71cae66b32216d64289 /health
parentad85ae35b941b1c6cae17d6ad07a8f2f3116cdda (diff)
multi-threaded version of freeipmi.plugin (#15327)
* multi-threaded version of freeipmi.plugin * fix type check * debug info * debug info * updated should be smaller, not bigger * ignore sensors without name * variable data collection frequencies for sensors and sel; also respect the min data collection frequency * reorg and code cleanup * collect states even for unknown units and empty names * render all sensors * reset unknown state sensors * ignore sensors without name * added component fan * Update ipmi.conf * added label type * remove global state counters and chart * updated copyright notice * remove unused struct members * remove unused variable * added a log line everytime the plugin decides to exit to show what was wrong * reworked freeipmi for optimal performance * disabled debugging and fixed bug * added debug * added debug * added debug * removed debugging info * cleanup and final touches * let fan metrics be categorized by the component they are cooling * added plugin and module to charts * more component matches * code cleanup, sel should now be a lot faster * make sel min collection time 30 secs * more component matches; refreshed functions copied from freeipmi codebase * add keepalive to avoid parser read timeout during ipmi_detect_speed_secs * ipmi.fan_speed => ipmi.sensor_fan_speed * update metrics csv and readme * ok newline --------- Co-authored-by: Ilya Mashchenko <ilya@netdata.cloud>
Diffstat (limited to 'health')
-rw-r--r--health/health.d/ipmi.conf10
1 files changed, 5 insertions, 5 deletions
diff --git a/health/health.d/ipmi.conf b/health/health.d/ipmi.conf
index feadba1b76..4d6478cca4 100644
--- a/health/health.d/ipmi.conf
+++ b/health/health.d/ipmi.conf
@@ -1,15 +1,15 @@
- alarm: ipmi_sensors_states
- on: ipmi.sensors_states
+ template: ipmi_sensor_state
+ on: ipmi.sensor_state
class: Errors
type: System
component: IPMI
calc: $warning + $critical
- units: sensors
+ units: state
every: 10s
- warn: $this > 0
+ warn: $warning > 0
crit: $critical > 0
delay: up 5m down 15m multiplier 1.5 max 1h
- info: number of IPMI sensors in non-nominal state
+ info: IPMI sensor ${label:sensor} (${label:component}) state
to: sysadmin
alarm: ipmi_events