Nagios Checks were not happening - system time change
Posted: Thu Aug 28, 2014 5:48 am
Dear All,
Today I found a strange issue in nagios.
Checks were not happening in nagios for 30 minutes
when i checked the nagios.log
following logs i found ... Please help me in finding out the root cause..
I found that checks were not happening and after some time ( 10 minutes) i found the below line in nagios.log
[1409219057] Warning: A system time change of 0d 0h 16m 56s (forwards in time) has been detected. Compensating...
I didnt do any sort of changes in the server..i didnt even login
after this i found countinous logs like
[Thu Aug 28 15:20:36 2014] Max concurrent service checks (3000) has been reached. Nudging Ranchi by 10 seconds...
[Thu Aug 28 15:20:36 2014] Max concurrent service checks (3000) has been reached. Nudging Raipur by 10 seconds...
[Thu Aug 28 15:20:37 2014] Max concurrent service checks (3000) has been reached. Nudging Vapi by 5 seconds...
[Thu Aug 28 15:20:37 2014] Max concurrent service checks (3000) has been reached. Nudging Indore by 13 seconds...
[Thu Aug 28 15:20:37 2014] Max concurrent service checks (3000) has been reached. Nudging Plant by 6 seconds...
[Thu Aug 28 15:20:37 2014] Max concurrent service checks (3000) has been reached. Nudging DNS Server by 5 seconds...
[Thu Aug 28 15:20:38 2014] Max concurrent service checks (3000) has been reached. Nudging Ghaziabad:Environment by 7 seconds...
[Thu Aug 28 15:20:38 2014] Max concurrent service checks (3000) has been reached. Nudging Noida by 6 seconds...
[Thu Aug 28 15:20:38 2014] Max concurrent service checks (3000) has been reached. Nudging Lucknow by 13 seconds...
[Thu Aug 28 15:20:38 2014] Max concurrent service checks (3000) has been reached. Nudging FARIDABAD:Uptime by 11 seconds...
[Thu Aug 28 15:20:39 2014] Max concurrent service checks (3000) has been reached. Nudging Raipur by 9 seconds...
[Thu Aug 28 15:20:39 2014] Max concurrent service checks (3000) has been reached. Nudging Mahindra-REVA Link by 9 seconds...
[Thu Aug 28 15:20:39 2014] Max concurrent service checks (3000) has been reached. Nudging Goregaon:Environment by 9 seconds...
[Thu Aug 28 15:20:39 2014] Max concurrent service checks (3000) has been reached. Nudging :Environment by 10 seconds...
[Thu Aug 28 15:20:40 2014] Max concurrent service checks (3000) has been reached. Nudging Ghaziabad:Memory Used by 8 seconds...
All these (Raipur,Vapi,Indore,Plant server, etcc..,,)were network devices and servers
For all the checks after that were logged like this...
after half an hour...
the issue got resolved automatically without doing anything and i found this log after ( 5 -7 minutes )
[1409221391] Auto-save of retention data completed successfully.
Please help me in getting this issue resolved...
If this is repeating...then i will be in trouble.
Thanks in advance...
Regards,
Ashok.
Today I found a strange issue in nagios.
Checks were not happening in nagios for 30 minutes
when i checked the nagios.log
following logs i found ... Please help me in finding out the root cause..
I found that checks were not happening and after some time ( 10 minutes) i found the below line in nagios.log
[1409219057] Warning: A system time change of 0d 0h 16m 56s (forwards in time) has been detected. Compensating...
I didnt do any sort of changes in the server..i didnt even login
after this i found countinous logs like
[Thu Aug 28 15:20:36 2014] Max concurrent service checks (3000) has been reached. Nudging Ranchi by 10 seconds...
[Thu Aug 28 15:20:36 2014] Max concurrent service checks (3000) has been reached. Nudging Raipur by 10 seconds...
[Thu Aug 28 15:20:37 2014] Max concurrent service checks (3000) has been reached. Nudging Vapi by 5 seconds...
[Thu Aug 28 15:20:37 2014] Max concurrent service checks (3000) has been reached. Nudging Indore by 13 seconds...
[Thu Aug 28 15:20:37 2014] Max concurrent service checks (3000) has been reached. Nudging Plant by 6 seconds...
[Thu Aug 28 15:20:37 2014] Max concurrent service checks (3000) has been reached. Nudging DNS Server by 5 seconds...
[Thu Aug 28 15:20:38 2014] Max concurrent service checks (3000) has been reached. Nudging Ghaziabad:Environment by 7 seconds...
[Thu Aug 28 15:20:38 2014] Max concurrent service checks (3000) has been reached. Nudging Noida by 6 seconds...
[Thu Aug 28 15:20:38 2014] Max concurrent service checks (3000) has been reached. Nudging Lucknow by 13 seconds...
[Thu Aug 28 15:20:38 2014] Max concurrent service checks (3000) has been reached. Nudging FARIDABAD:Uptime by 11 seconds...
[Thu Aug 28 15:20:39 2014] Max concurrent service checks (3000) has been reached. Nudging Raipur by 9 seconds...
[Thu Aug 28 15:20:39 2014] Max concurrent service checks (3000) has been reached. Nudging Mahindra-REVA Link by 9 seconds...
[Thu Aug 28 15:20:39 2014] Max concurrent service checks (3000) has been reached. Nudging Goregaon:Environment by 9 seconds...
[Thu Aug 28 15:20:39 2014] Max concurrent service checks (3000) has been reached. Nudging :Environment by 10 seconds...
[Thu Aug 28 15:20:40 2014] Max concurrent service checks (3000) has been reached. Nudging Ghaziabad:Memory Used by 8 seconds...
All these (Raipur,Vapi,Indore,Plant server, etcc..,,)were network devices and servers
For all the checks after that were logged like this...
after half an hour...
the issue got resolved automatically without doing anything and i found this log after ( 5 -7 minutes )
[1409221391] Auto-save of retention data completed successfully.
Please help me in getting this issue resolved...
If this is repeating...then i will be in trouble.
Thanks in advance...
Regards,
Ashok.