Page 1 of 1

Performance data not collected since upgrade to 2012R2.8c

Posted: Sun Feb 02, 2014 6:27 pm
by genmais
I have recently installed Nagios XI using the VMware image including 2012R2.7. Now I have upgraded the VM to 2012R2.8c using source installer.
Upgrade finished successfully according to the upgrade script's output.
However, since the upgrade was done no performance data has been collected. All files in /usr/local/nagios/share/perfdata haven't been touched since the upgrade. Neither the file /usr/local/nagios/var/npcd.log hasn't been touched.
perfdata.log shows a couple of timeouts:
[root@nagiosxi perfdata]# tail -25 /usr/local/nagios/var/perfdata.log
2014-01-29 00:14:00 [10245] [0] *** TIMEOUT: Timeout after 5 secs. ***
2014-01-29 00:14:00 [10245] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2014-01-29 00:14:00 [10245] [0] *** TIMEOUT: Please check your npcd.cfg
2014-01-29 00:14:00 [10245] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1390913398.perfdata.service-PID-10245 deleted
2014-01-29 00:14:00 [10245] [0] *** Timeout while processing Host: "" Service: ""
2014-01-29 00:14:00 [10245] [0] *** process_perfdata.pl terminated on signal ALRM
2014-01-29 00:14:00 [10243] [0] *** TIMEOUT: Timeout after 5 secs. ***
2014-01-29 00:14:00 [10243] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2014-01-29 00:14:00 [10243] [0] *** TIMEOUT: Please check your npcd.cfg
2014-01-29 00:14:00 [10243] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1390913413.perfdata.service-PID-10243 deleted
2014-01-29 00:14:00 [10243] [0] *** Timeout while processing Host: "localhost" Service: "Swap_Usage"
2014-01-29 00:14:00 [10243] [0] *** process_perfdata.pl terminated on signal ALRM
Right after the upgrade performance graphs nicely showed the old performance data. However, graphs became shorter and shorter and when 24h were over since the upgrade all graphs a blank.

Any ideas how to reinvent collecting performance data? Thanks!

Re: Performance data not collected since upgrade to 2012R2.8

Posted: Mon Feb 03, 2014 11:18 am
by abrist
First, lets check to see if npcd is running and restart it:

Code: Select all

service npcd status
service npcd stop
ps -aef | grep npcd
killall npcd
service npcd start
Next, lets check the perfdata logs:

Code: Select all

tail -20 /usr/local/nagios/var/perfdata.log
tail -20 /usr/local/nagios/var/npcd.log
Lets make sure cron is running:

Code: Select all

service crond status
And finally, lets check the size of the spool directories:

Code: Select all

ls /usr/local/nagios/var/spool/xidpe | wc -l
ls /usr/local/nagios/var/spool/perfdata | wc -l
ls /usr/local/nagios/var/spool/checkresults | wc -l

Re: Performance data not collected since upgrade to 2012R2.8

Posted: Mon Feb 03, 2014 12:58 pm
by genmais
Meanwhile I found a hint in another thread which apparently solves the issue:
service npcd restart
Haven't thought that it's that simple. However, after some period of heavy load on the system the performance graphs are back.

What still confuses me a bit is that I have rebooted the machine after the upgrade. Just to avoid strange things. However, npcd seemed not to come up properly, manual restart was necessary anyway. Maybe any raise condition during startup that confused npcd...

Anyway, thanks for your support.

Re: Performance data not collected since upgrade to 2012R2.8

Posted: Mon Feb 03, 2014 1:22 pm
by abrist
No problem. It could be a race condition, let us know if the issues is frequent.