alerting on service status summary
Posted: Wed Oct 29, 2014 9:25 am
Hi;
Is there a way to monitor on the overall service status summary? i.e. When I can, I configure any service timeouts to a status of unknown, which I don't alert on. Instead I would like to send a single an alert when the total number of unknowns exceed a certain threshold ? This would save me from flooding pagers for what may be considered a monitoring failure or a system wide outage of some sort. I'm sure it can be debated what constitutes an alert, but in our environment people are sensitive to receiving pages for something that isn't an actual threshold exception, the host check is considered the availability monitor so any other timeout alerts aren't really welcome.
Is there a way to monitor on the overall service status summary? i.e. When I can, I configure any service timeouts to a status of unknown, which I don't alert on. Instead I would like to send a single an alert when the total number of unknowns exceed a certain threshold ? This would save me from flooding pagers for what may be considered a monitoring failure or a system wide outage of some sort. I'm sure it can be debated what constitutes an alert, but in our environment people are sensitive to receiving pages for something that isn't an actual threshold exception, the host check is considered the availability monitor so any other timeout alerts aren't really welcome.