Page 1 of 2

Host Rechecks faster than retry setting

Posted: Wed Nov 25, 2015 5:29 pm
by lytxnoc
We have an issue, where once a host goes into a soft down state, the retry check runs quicker than the host setting. Our retry interval is set for 1 minute on all hosts. but when they go into a soft down state, the check retries anywhere from 29-39 seconds, when it should retry at 1 minute.

Here is a sampling of the soft down checks.
2015-11-24 18:53:28 hqs1v-dl3com02 DOWN SOFT 4 of 5 CRITICAL - : rta nan, lost 100%
2015-11-24 18:53:00 hqs1v-dl3com02 DOWN SOFT 3 of 5 CRITICAL - : rta nan, lost 100%
2015-11-24 18:52:15 hqs1v-dl3com02 DOWN SOFT 2 of 5 CRITICAL - : rta nan, lost 100%
2015-11-24 18:51:10 hqs1v-dl3com02 DOWN SOFT 1 of 5 CRITICAL - : rta nan, lost 100%

2015-11-24 02:54:37 achdpapp02 DOWN SOFT 5 of 10 CRITICAL - : rta nan, lost 100%
2015-11-24 02:54:09 achdpapp02 DOWN SOFT 4 of 10 CRITICAL - : rta nan, lost 100%
2015-11-24 02:53:50 achdpapp02 DOWN SOFT 3 of 10 CRITICAL - : rta nan, lost 100%
2015-11-24 02:53:11 achdpapp02 DOWN SOFT 2 of 10 CRITICAL - : rta nan, lost 100%
2015-11-24 02:52:50 achdpapp02 DOWN SOFT 1 of 10 CRITICAL - : rta nan, lost 100%

Nagios Environment
Linux Distribution and version - CentOS release 6.6 (Final)
32 or 64bit - 64bit
VMware Image or Manual Install of XI? - VMWare Image
Are there special configurations on your system, ie; is Gnome installed - Nope
Are you using a proxy - Nope
Are you using SSL - Nope

check_host_freshness=0
host_freshness_check_interval=60
host_inter_check_delay_method=s
max_host_check_spread=30

This is a similar issue to: https://support.nagios.com/forum/viewto ... =7&t=22249 but we never saw a resolution.

Thanks.

Re: Host Rechecks faster than retry setting

Posted: Mon Nov 30, 2015 12:09 pm
by rkennedy
This could be a bug, seeing as a solution wasn't found with the past thread. I'd like to gather a bit more information about your system though -

How many hosts / service checks are running on your machine? How many CPUs are allocated to it? What's the load like on it?

As well, please post the output of the following-

Code: Select all

top|head -5
Navigate to Admin -> Monitoring Engine Status, and Admin -> System Status - post a screenshot of both pages for us to take a look at.

Re: Host Rechecks faster than retry setting

Posted: Mon Nov 30, 2015 4:54 pm
by lytxnoc
Attached are the screenshots for top, Monitoring Engine status and System Status.

Let me know if you need any further information.

Re: Host Rechecks faster than retry setting

Posted: Tue Dec 01, 2015 2:41 pm
by rkennedy
Nothing looks too crazy above. Can you post a complete definition for one of the hosts as well?

Re: Host Rechecks faster than retry setting

Posted: Tue Dec 01, 2015 6:05 pm
by lytxnoc
Here is the host config's for one of the Hosts. (Nothing out of the ordinary here.)

Re: Host Rechecks faster than retry setting

Posted: Tue Dec 01, 2015 10:21 pm
by Box293
Can you please post the config for the host template 24x7-linux-server_event including other definitions used by this template such as time periods.

Also, can you find this host object in these files and post the details here please:
/usr/local/nagios/var/objects.cache
/usr/local/nagios/var/retention.dat


If you have a ramdisk implemented the objects.cache may not be in this location and you'll need to consult /usr/local/nagios/etc/nagios.cfg for it's location.

Re: Host Rechecks faster than retry setting

Posted: Wed Dec 02, 2015 1:59 pm
by lytxnoc
Attached is the server template, retention and cache information.

Re: Host Rechecks faster than retry setting

Posted: Wed Dec 02, 2015 2:01 pm
by lytxnoc
Rentention.cache

Re: Host Rechecks faster than retry setting

Posted: Wed Dec 02, 2015 6:35 pm
by tmcdonald
Do you have parent/child relationships set up, or predictive checks enabled?

https://assets.nagios.com/downloads/nag ... ncies.html
https://assets.nagios.com/downloads/nag ... ility.html
https://assets.nagios.com/downloads/nag ... hecks.html

Either of those might be causing things to be checked more often than they are configured to.

Re: Host Rechecks faster than retry setting

Posted: Fri Dec 04, 2015 10:59 am
by lytxnoc
We do have Service Dependency checks setup. I'll remove the dependency checks and post the results after the next issue.