Page 1 of 1

alerting on service status summary

Posted: Wed Oct 29, 2014 9:25 am
by paul.jobb
Hi;

Is there a way to monitor on the overall service status summary? i.e. When I can, I configure any service timeouts to a status of unknown, which I don't alert on. Instead I would like to send a single an alert when the total number of unknowns exceed a certain threshold ? This would save me from flooding pagers for what may be considered a monitoring failure or a system wide outage of some sort. I'm sure it can be debated what constitutes an alert, but in our environment people are sensitive to receiving pages for something that isn't an actual threshold exception, the host check is considered the availability monitor so any other timeout alerts aren't really welcome.
Capture3.PNG

Re: alerting on service status summary

Posted: Wed Oct 29, 2014 1:42 pm
by slansing
Well, you could do this to some extent by using service dependencies, or BPI, I would take a look at this:

http://nagios.sourceforge.net/docs/nagi ... tions.html

BPI is already installed by default, but the later portion of this document covers it pretty well from a top level perspective:

http://assets.nagios.com/downloads/nagi ... _Addon.pdf

Re: alerting on service status summary

Posted: Wed Oct 29, 2014 3:03 pm
by paul.jobb
Thanks I do use BPI and service dependencies for other processes, its just that they require some configuration, I will look at them again and see if that's what I want . I might also choose to use the mysql plugin and just query for the count of unknowns and alert when they are high(meaning to me there is something going on with monitoring connectivity). You can close this off Thanks.

Re: alerting on service status summary

Posted: Wed Oct 29, 2014 3:10 pm
by cmerchant
I will close for now. Thanks.