Page 1 of 1

Service Alerting

Posted: Mon Dec 15, 2014 8:53 am
by jstoddart
Hi,

We are trying to use a combination of Check Interval, Retry Interval and Max Check Attempts to stop certain services appearing on the Operations Center View as soon as the first check fails. As far as we understand from the available docs setting these 3 variables 2 5,1 and 5 respectively should mean the following:

Check runs every 5 mins, if its not Ok continues to check every 1 minute, if its still "not Ok" after 5 of these checks then alert.

What we see is the alert trigger straight away and if we view the Advanced tab it shows "1 of 5".

How do we adjust these settings so that alerts only appear after a defined number of failures?

Thanks

Jamie

Re: Service Alerting

Posted: Mon Dec 15, 2014 11:37 am
by abrist
What you are seeing is the SOFT state "alert" (really just a UI change), which will not send out notifications. It seems like you are looking for an option to only display HARD problem states, is that correct?

Re: Service Alerting

Posted: Mon Dec 15, 2014 11:39 am
by slansing
Well, I can see why you are having some issues here. The Operations screen, and in fact, any of those displays work off of state changes, not alerts/notifications. So no matter what you have set up for your check intervals, retries, notifications, etc, will not effect what is displayed there, only the true service/host state will, to build off of what Abrist mentioned.

Re: Service Alerting

Posted: Tue Dec 16, 2014 6:33 am
by jstoddart
Ok thanks,

So basically there is no way to stop these appearing on the Operations Center screen then?

Re: Service Alerting

Posted: Tue Dec 16, 2014 10:33 am
by tmcdonald
Correct. The screen is meant to show all problems so you can get them resolved faster than if you would wait for an alert, and maybe even sooner.

Re: Service Alerting

Posted: Wed Dec 17, 2014 4:54 am
by jstoddart
Ok, thanks

Re: Service Alerting

Posted: Wed Dec 17, 2014 10:48 am
by tmcdonald
All free to close this up?