Page 1 of 1

Active checks stuck every day at same time

Posted: Fri Mar 02, 2012 3:12 am
by pdusza
Hi,

From about three days I have a strange issue with nagiosXI. At about 1 am almost all active checks and performance graph stuck. What is really interesting 'Next check' date is set to another day at midnight. The only thing which I can do is to schedule immediate check on all host and services.
Some hosts/services are ok (updated and checked) but most of them just stuck. I checked the npcd logs, nagios logs and I don't see there any errors, NPCD deamon is running. There is no differences between hosts/services definition of those which stuck and running (same templates).

I have no idea what could be wrong.

Please look at screens:
Capture1.PNG
Capture3.PNG
Capture4.PNG

Re: Active checks stuck every day at same time

Posted: Fri Mar 02, 2012 11:16 am
by mguthrie
That is pretty odd. Can you show us the output from the following command?

Code: Select all

chage -l nagios

First, lets make sure we don't have multiple instances of Nagios that have spawned. Run the following from the command-line:

Code: Select all

service nagios stop
killall -9 nagios
service nagios start

Then lets check to make sure there isn't any DB corruption:
http://assets.nagios.com/downloads/nagi ... tabase.pdf

Re: Active checks stuck every day at same time

Posted: Fri Mar 02, 2012 4:41 pm
by pdusza
Hi,

here is a output of that commands:
Capture5.PNG
and here is an output of repairing process (no errors):
result.txt