When a host enters a critical DOWN state, nagios will retry every ~60 seconds up to the max retry count, then send a notification on the final HARD down state. The problem is, my retry internal is set to 5 mins. I can't understand why it's retrying every 60 secs, instead of honoring the retry interval specified in the host definition.
My hosts are configured with the following intervals:
check_interval=5
retry_interval=5
max_check_attempts=4
Main main config contains, interval_length=60.
Could it be due to the fact that my check and retry intervals are the same? What could be causing it to retry every ~ 60 sec, instead of every 5 mins?
Thanks for any help you can provide.
-wr
retry_interval not honored
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: retry_interval not honored
Do you have any services on this that have a lower retry_interval?
The services need to trigger a forces recheck of the host to see if it is still down
The services need to trigger a forces recheck of the host to see if it is still down
Re: retry_interval not honored
That was the issue. There were service checks with retry intervals of 1 min attached to the host. Once i backed these off, the 5min retry interval was honored. Thanks for the reply.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: retry_interval not honored
Great!warapp wrote:That was the issue. There were service checks with retry intervals of 1 min attached to the host. Once i backed these off, the 5min retry interval was honored. Thanks for the reply.
Locking thread