Page 1 of 1

Nagios Alerting - Condition + Time

Posted: Wed Jan 22, 2020 7:56 am
by Eireocean
Good Day,

We have been tasked with monitoring AIX Servers, specifically related to Entitlements. They only want to be alerted if a servers entitlement, as an example, is above 95% and is sustained for longer 60 minutes.

Is or would the following be the most reliable way :

max_check_attempts 10
retry_interval 6


... or is there an alternative method ? I am trying to factor im what happens in between the intervals of 6 minutes.

Regards

Re: Nagios Alerting - Condition + Time

Posted: Wed Jan 22, 2020 3:32 pm
by mbellerue
That is probably the best way to do it. Regarding what happens between intervals, what I'm hearing is that the responding team may be unhappy if they investigate the server, and find that for 5 minutes out of 60 minutes, the system load was actually fine. If that is a problem, then you can always adjust the retry and max check attempt intervals. Nothing stops you from having max checks set to 60 and retry interval set to 1 minute.