Page 1 of 1

Linux Server -Service status Delay

Posted: Fri Feb 15, 2013 3:11 am
by sathiyaraj
Hi,

I have monitor my linux server through Nagios XI. when i shut down my server, service status ( Nagios XI )its not immediately take effect.

Note:
i edited Monitoring Settings also ( as 1 min )

Thanks in advance

Re: Linux Server -Service status Delay

Posted: Fri Feb 15, 2013 10:22 am
by abrist
How immediate? If the host alive checks are set to one minute, XI should register the critical within 1 minute of the server shutting down. The details page in XI could take up to another 30 seconds or so to update the information. This is the expected behavior, does your experience differ?

Re: Linux Server -Service status Delay

Posted: Sat Feb 16, 2013 12:20 am
by sathiyaraj
Host status is immediately down. but services taking so late to update.

Re: Linux Server -Service status Delay

Posted: Mon Feb 18, 2013 8:03 am
by scottwilkerson
Did you edit the check_interval for all of the services too or just the host?

Re: Linux Server -Service status Delay

Posted: Tue Feb 26, 2013 5:21 am
by sathiyaraj
Check Interval in Nagios Server Right..? Its defines 1 mins..

Re: Linux Server -Service status Delay

Posted: Tue Feb 26, 2013 10:39 am
by sreinhardt
What are the check_interval is defined independenly for the service and host checks. Along with the service's check_interval, what is your retry_interval and max_check_attempts. The math behind this would be [inital check_interval time]+(retry_interval * max_check_attempts). Finally how long is it taking for a service check to fail after the host is down?

Re: Linux Server -Service status Delay

Posted: Wed Feb 27, 2013 1:00 pm
by sathiyaraj
I defined 1 minutes interval..Is there any option to check with in seconds..? But my case, server status down after 5 mins only..

Re: Linux Server -Service status Delay

Posted: Wed Feb 27, 2013 2:00 pm
by abrist
From: http://nagios.sourceforge.net/docs/3_0/configmain.html
Timing Interval Length

Format: interval_length=<seconds>
Example: interval_length=60

This is the number of seconds per "unit interval" used for timing in the scheduling queue, re-notifications, etc. "Units intervals" are used in the object configuration file to determine how often to run a service check, how often to re-notify a contact, etc.

Important: The default value for this is set to 60, which means that a "unit value" of 1 in the object configuration file will mean 60 seconds (1 minute). I have not really tested other values for this variable, so proceed at your own risk if you decide to do so!
So you can change the interval_length to 1 in your nagios.cfg file and checks intervals will be measured in seconds (remember to change your check intervals!). A warning though, you can very easily overload your box and the ones you are checking if you do not handle this changed setting with care. I would highly suggest that you consider long and hard about doing checks more frequently than 15 or 20 seconds. And as you add more checks, this will become a bigger problem with nagios server's load.