Page 1 of 1

Check Retries

Posted: Thu Jan 14, 2021 8:51 am
by hollanro
Newbie question; if I set a check to run every 5 minutes with a check interval of 1 minute and a maximum retry of 10 times, does that mean if the service is not UP, the first check that discovered the DOWN state is still running retries when the second scheduled check kicks off 5 minutes later?

MInCheck1

Re: Check Retries

Posted: Fri Jan 15, 2021 11:39 am
by benjaminsmith
Hi @hollanro,

There are two settings, the check interval and the retry interval. I believe you have the check interval set to 5 minutes and then retry interval set to one minute. Once the host or service goes non-ok, then the retry interval kicks in at 1 minute (in this example).

Here's Nagios Core Docs on these two settings:
check_interval: This directive is used to define the number of "time units" between regularly scheduled checks of the host. Unless you've changed the interval_length directive from the default value of 60, this number will mean minutes. More information on this value can be found in the check scheduling documentation.

retry_interval: This directive is used to define the number of "time units" to wait before scheduling a re-check of the hosts. Hosts are rescheduled at the retry interval when they have changed to a non-UP state. Once the host has been retried max_check_attempts times without a change in its status, it will revert to being scheduled at its "normal" rate as defined by the check_interval value. Unless you've changed the interval_length directive from the default value of 60, this number will mean minutes. More information on this value can be found in the check scheduling documentation
Take a look and let me know if you need clarification on anything. If you want to learn more about the logic for state changes, the following doc is really helpful (see the table at the bottom).

https://assets.nagios.com/downloads/nag ... types.html

Best Regards,
Benjamin