summaryrefslogtreecommitdiffstats
path: root/health
AgeCommit message (Collapse)Author
2022-05-03Check for chart obsoletion on children re-connections (#12707)Emmanuel Vasilakis
* check for chart obsoletion on children connections * use rrdset_is_obsolete
2022-05-02Reduce alert events sent to the cloud. (#12544)Emmanuel Vasilakis
* filter * update filter * queue removed directly * more * logging * cleanup * cleanup 2 * cleanup 3 * finalize instead of reset
2022-04-25Tag Gotify health notifications for the Gotify phone app (#12753)JaphethLim
2022-04-20Store alert log entries even if alert is repeating. (#12226)Emmanuel Vasilakis
* store alarm log entries even if it is repeating * log times repeated for an alert
2022-04-20Gotify notifications (#12639)coffeegrind123
2022-04-11Add a timeout parameter to data queries (#12649)Stelios Fragkakis
* Add timeout parameter in queries and in calling functions * Add CANCEL flag in RRDR and code to cancel a query * Update swagger * Format swagger file properly
2022-04-01Fix memory leaks on Netdata exit (#12511)Vladimir Kobal
* Fix memory leaks in dimensions and charts * Initialize superblock memory regions * Clean up static threads * Fix memory leaks in compression * Fix memory leaks in rrdcaltemplate * Fix memory leaks in health config * Fix ACLK memory leaks
2022-03-31Initialize foreach alarms of dimensions in health thread. (#12452)vkalintiris
The previous approach required us to try wr-lock the host after locking a chart and sleeping on failure. Lock contention would lead to alarms not being created and the agent to become unresponsive.
2022-03-23Adjust timex.plugin information to be less cryptic (#12495)DanTheMediocre
Co-authored-by: Ilya Mashchenko <ilya@netdata.cloud> Co-authored-by: Tina Luedtke <kickoke@users.noreply.github.com>
2022-03-18fix(collectors/cgroups): use different context for cgroup network charts ↵Ilya Mashchenko
(#12454)
2022-03-15Add additional link to badges doc (#12412)Steve8291
Make it easier to find the badges link when looking for OPTIONS.
2022-03-15Remove backends subsystem (#12146)Vladimir Kobal
2022-03-14Delay removed event for 60 seconds after the chart's last collected time ↵Emmanuel Vasilakis
(#12388) * delay removed event for 60 seconds * change check
2022-03-11Only store alert hashes once per health config iteration (#12292)Emmanuel Vasilakis
* only store alert hashes when iterated from localhost * store hashes on start and health reload, at least for one pass of a host
2022-03-09fix(health): make ioping_disk_latency alarm less sensitive (#12351)Ilya Mashchenko
2022-03-07Add latency dimension (#12329)Steve8291
2022-03-04fix(health): adjust 10s_ipv4_tcp_resets_sent warn trigger (#12320)Ilya Mashchenko
2022-03-02feat(health): add charts.d/nut alarms (#12285)Ilya Mashchenko
2022-02-22Update AWS SNS README.md (#11946)Tina Luedtke
* Update README.md I mainly made changes to the structure and the copy to improve legibility. However, I still need some input for these sections: ### line 18: Q: How is Netdata better than SNS? ### line 31: - While not mandatory, it is highly recommended to create a dedicated IAM user on your account for Netdata to send notifications. This user needs to have programmatic access, and should only allow access to SNS. If you're really paranoid, you can create one for each system or group of systems. Q: Can someone clarify what this means and how we can say the same thing without calling our users paranoid? ### line 43 * Q: I need to know which config file the user needs to change. * Q: I need clarification on this sentence: This variable supports all the same variables you can use in custom notifications. I've also found a [configuration script of a community member](http://pkg.cs.ovgu.de/LNF/linux/ubuntu/18.04/LNFnetdata/reloc/share/netdata/conf.d/health_alarm_notify.conf) containing an AWS SNS section. Is the information relevant to only this community member or a wider audience? Can we possibly use some of it as an example? ------------------------------------------------------------------------------ # Amazon SNS notifications # # This method requires potentially complex manual configuration. See the # netdata wiki for information on what is needed. # enable/disable sending Amazon SNS notifications SEND_AWSSNS="YES" # Specify a template for the Amazon SNS notifications. This supports # the same set of variables that are usable in the `custom_sender()` # function in the custom notification configuration below. # AWSSNS_MESSAGE_FORMAT="${status} on ${host} at ${date}: ${chart} ${value_string}" # If a role's recipients are not configured, use the following. # (empty = do not send a notification for unconfigured roles) # # Recipients for AWS SNS notifications are specified as topic ARN's. # DEFAULT_RECIPIENT_AWSSNS="" Thank you! PS.: I kept Google analytics, as we will sweep out these links in a different initiative. * Applied comments from code review * Removed Typo
2022-02-22Skip info field in protobuf alerts messages if it doesn't exist. (#12210)Emmanuel Vasilakis
* dont assume info field exists * add info field to documentation
2022-02-22remove deprecated node.d modules (#12047)Ilya Mashchenko
2022-02-21Create a removed alert event if chart is obsolete (#12021)Emmanuel Vasilakis
2022-02-17Fix alerts to raise correctly when the delay and repeat parameters are used ↵Erdem Ergen
together (#12164)
2022-02-17Docs: Removed Google Analytics tags (#12145)Tina Luedtke
2022-02-02Updated formatting issues and copy (#11944)Tina Luedtke
Co-authored-by: Tina Luedtke <kickoke@users.noreply.github.com> Co-authored-by: Ilya Mashchenko <ilya@netdata.cloud>
2022-01-28fix(docs): unresolved file references (#12053)Ilya Mashchenko
2022-01-19Fix queue removed alerts (#11996)Emmanuel Vasilakis
* delay queueing removed alerts * parenthesis * remove debug
2022-01-19Compute platform-specific list of static_threads at runtime. (#11955)vkalintiris
Compute array of static threads at runtime.
2022-01-17Add localhost hostname to the edit_command (#11793)Emmanuel Vasilakis
* include localhost hostname in edit_command * since the edit_command now contains the localhost name, dont pass it again to the script
2022-01-11Fix time_t format (#11897)Vladimir Kobal
2022-01-11Allow PushBullet notifications to be sent to PushBullet channels (#11850)sourcecodes2
Co-authored-by: Tina Luedtke <kickoke@users.noreply.github.com> Co-authored-by: Ilya Mashchenko <ilya@netdata.cloud>
2021-12-23chore: update community link of alert notifications (#11860)Marilena Ioannou
2021-12-22Improve PagerDuty notification doc (#11147)Joel Hans
Co-authored-by: ilyam8 <ilya@netdata.cloud>
2021-12-20Added "==" to the list of expression operators (#11905)David Lane
2021-12-20fix(docs): unresolved file references (#11903)Ilya Mashchenko
2021-12-09add z score alarm example (#11871)Andrew Maguire
* add z score alarm examples * give examples header titles
2021-12-07fix(health): used_swap alarm calc (#11868)Ilya Mashchenko
2021-11-30fix alarms info (#11844)Ilya Mashchenko
2021-11-29Remove pihole_blocked_queries alert (#11829)Fotis Voutsas
2021-11-18Always queue alerts to aclk_alert (#11806)Emmanuel Vasilakis
* always queue to aclk_alert * proper function name
2021-11-16Fix typos (#11782)Dimitris Apostolou
Co-authored-by: ilyam8 <ilya@netdata.cloud>
2021-10-25Queue removed alerts to cloud for new architecture (#11704)Emmanuel Vasilakis
* rebased * add error message * make function void * fix return
2021-10-22Fix coverity issues 373612 & 373611 (#11684)Emmanuel Vasilakis
* fix 2 coverity errors * remove call to sql_queue_removed_alerts_to_aclk from health
2021-10-19Add snapshot message and calls to sql_queue_removed_alerts_to_aclk (#11664)Emmanuel Vasilakis
2021-10-18fix swap_used alarm calc (#11672)Ilya Mashchenko
2021-10-18Revert "Update alarms info" (#11633)Ilya Mashchenko
2021-10-06Enable additional functionality for the new cloud architecture (#11579)Stelios Fragkakis
2021-09-27Update alarms info (#11481)Ilya Mashchenko
* update apcupsd alarm's info
2021-09-23Add alert message support for ACLK new architecture (#11552)Emmanuel Vasilakis
* add alert messages * also clear date_cloud_ack * move buffer_create * remove include file * use wc->node_id
2021-09-21fix coverity (#11535)Emmanuel Vasilakis