Page 1 of 4

Service checks keep postponing

Posted: Tue Jun 09, 2015 11:03 am
by rkymtnhigh
I am running Nagios XI 2014R1.4 on CentOS 6

I changed the check interval for a select group of services across different hosts and have messed something up.
After changing the intervals, the checks are no longer running.
When I watch the next check field (that shows the time) when looking at the service, it keeps changing by the check interval and does not run.
I've tried deleting and setting up the services, but the check stays pending. I can force an immediate check with works fine, but it never runs again.

I get the feeling I am just missing something for these services I have messed with. I would love to avoid starting over!

Thanks in advance for any and all help.

Re: Service checks keep postponing

Posted: Tue Jun 09, 2015 12:50 pm
by jdalrymple
rkymtnhigh wrote:After changing the intervals, the checks are no longer running.

...

I've tried deleting and setting up the services, but the check stays pending. I can force an immediate check with works fine, but it never runs again.
How do you know the checks are not running? What does the "Last Check" field say in Service Status Detail?

Re: Service checks keep postponing

Posted: Tue Jun 09, 2015 1:41 pm
by rkymtnhigh
I know they are not running because there is no performance graph info, system event info and the last check time does not update.
Also, if I set up a new check, it will simply stay in PENDING.

The last check field displays the last time the check was manually run for testing.

Thanks for the help.

Re: Service checks keep postponing

Posted: Tue Jun 09, 2015 1:52 pm
by tgriep
Could you go in to Core Config Manager, select one of the host or services that is failing and post it's configuration?

Re: Service checks keep postponing

Posted: Tue Jun 09, 2015 1:58 pm
by rkymtnhigh
Sure.

define service {
host_name sis01-cert
service_description svc_puppet
use critical-service
check_command check_xi_service_nsclient!!SERVICESTATE!-l pe-puppet -d SHOWALL!!!!!
initial_state c
max_check_attempts 2
check_interval 5
retry_interval 2
active_checks_enabled 1
passive_checks_enabled 0
check_period 24x7
flap_detection_enabled 1
notification_interval 15
notification_period 24x7
notification_options c,r,
notifications_enabled 1
contacts nabops,pagerduty
contact_groups NAB-OnCall,NAB-Ops
register 1
}

Re: Service checks keep postponing

Posted: Tue Jun 09, 2015 2:05 pm
by jdalrymple
1 check or all checks?

What do things look like in your nagios core interface, sometimes it can be a bit more illuminating. Can you show us a screenshot of the service details from the core interface?

http://<mynagiosserverip>/nagios

Re: Service checks keep postponing

Posted: Tue Jun 09, 2015 2:41 pm
by rkymtnhigh
^ I changed the check_interval setting on a bunch of services, and those *seem* to be the ones having trouble now. So it's spread across most hosts, some here, some there.

Here is the requested screenshot of the service details in core for a service that hasn't run since the day I changed the setting.

Thanks again.

Re: Service checks keep postponing

Posted: Tue Jun 09, 2015 2:45 pm
by jdalrymple
That is strange. If it says it's scheduled then certainly it should run. Are you seeing anything in nagios.log? If not let's enable debugging and look there...

nagios.cfg:

Code: Select all

debug_file=/usr/local/nagios/var/nagios.debug
debug_level=-1
debug_verbosity=2
max_debug_file_size=10000000

Re: Service checks keep postponing

Posted: Tue Jun 09, 2015 3:17 pm
by rkymtnhigh
Ok, I've got that set up.
A tail of that log says the following
Capture1.PNG

Re: Service checks keep postponing

Posted: Tue Jun 09, 2015 3:37 pm
by jdalrymple
You'll note the verbosity of that debug log is ridiculous. In your screenshot, less than 1 millisecond of results. You will have to do a little interpretation yourself and pull out the pertinent portions to analyze.