Perfdata not working anymore
Posted: Fri Nov 15, 2013 3:31 am
Hello,
After my issue after the reboot yesterday, it seems our graphs don't get any data anymore!
tail -25 /usr/local/nagios/var/perfdata.log
2013-11-14 07:00:32 [23456] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-11-14 07:00:32 [23456] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-11-14 07:00:32 [23456] [0] *** TIMEOUT: Please check your npcd.cfg
2013-11-14 07:00:32 [23455] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-11-14 07:00:32 [23459] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-11-14 07:00:32 [23455] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-11-14 07:00:32 [23455] [0] *** TIMEOUT: Please check your npcd.cfg
2013-11-14 07:00:32 [23459] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-11-14 07:00:32 [23459] [0] *** TIMEOUT: Please check your npcd.cfg
2013-11-14 07:00:32 [23456] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1384408803.perfdata.service-PID-23456 deleted
2013-11-14 07:00:32 [23456] [0] *** Timeout while processing Host: "hostname" Service: "SRV_Ping"
2013-11-14 07:00:32 [23456] [0] *** process_perfdata.pl terminated on signal ALRM
2013-11-14 07:00:32 [23455] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1384408803.perfdata.host-PID-23455 deleted
2013-11-14 07:00:32 [23455] [0] *** Timeout while processing Host: "hostname" Service: "_HOST_"
2013-11-14 07:00:32 [23459] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1384408818.perfdata.service-PID-23459 deleted
2013-11-14 07:00:32 [23455] [0] *** process_perfdata.pl terminated on signal ALRM
2013-11-14 07:00:32 [23459] [0] *** Timeout while processing Host: "hostname" Service: "SRV_CPU"
2013-11-14 07:00:32 [23459] [0] *** process_perfdata.pl terminated on signal ALRM
There seems to be no activity after 07:00...
ls /usr/local/nagios/var/spool/checkresults | wc -l
9
ls /usr/local/nagios/var/spool/perfdata | wc -l
5907
ls /usr/local/nagios/var/spool/xidpe | wc -l
0
Any help is welcome!
Already did a restart of npcd
service npcd restart
[root@nagios perfdata]# df -i
Filesystem Inodes IUsed IFree IUse% Mounted on
/dev/mapper/VolGroup00-LogVol00
1286144 111734 1174410 9% /
tmpfs 490139 1 490138 1% /dev/shm
/dev/sda1 25688 39 25649 1% /boot
cifs:/root_vdm_2/cifs402/Backups/SRVNAGIOS01
257949694 3859915 254089779 2% /var/Digipolis/Backup
[root@nagios perfdata]# df -h
Filesystem Size Used Avail Use% Mounted on
/dev/mapper/VolGroup00-LogVol00
20G 7.4G 11G 41% /
tmpfs 1.9G 0 1.9G 0% /dev/shm
/dev/sda1 97M 28M 65M 31% /boot
cifs:/root_vdm_2/cifs402/Backups/SRVNAGIOS01
4.9T 2.9T 2.0T 60% /var/Digipolis/Backup
cat /usr/local/nagios/etc/nagios.cfg | grep check_result
check_result_path=/usr/local/nagios/var/spool/checkresults
check_result_reaper_frequency=10
max_check_result_file_age=3600
max_check_result_reaper_time=30
nano /usr/local/nagios/etc/pnp/process_perfdata.cfg
Changed TIMEOUT to 10 and restarted NPCD service.
Tried to gather as much info as I could find.
After my issue after the reboot yesterday, it seems our graphs don't get any data anymore!
tail -25 /usr/local/nagios/var/perfdata.log
2013-11-14 07:00:32 [23456] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-11-14 07:00:32 [23456] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-11-14 07:00:32 [23456] [0] *** TIMEOUT: Please check your npcd.cfg
2013-11-14 07:00:32 [23455] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-11-14 07:00:32 [23459] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-11-14 07:00:32 [23455] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-11-14 07:00:32 [23455] [0] *** TIMEOUT: Please check your npcd.cfg
2013-11-14 07:00:32 [23459] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-11-14 07:00:32 [23459] [0] *** TIMEOUT: Please check your npcd.cfg
2013-11-14 07:00:32 [23456] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1384408803.perfdata.service-PID-23456 deleted
2013-11-14 07:00:32 [23456] [0] *** Timeout while processing Host: "hostname" Service: "SRV_Ping"
2013-11-14 07:00:32 [23456] [0] *** process_perfdata.pl terminated on signal ALRM
2013-11-14 07:00:32 [23455] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1384408803.perfdata.host-PID-23455 deleted
2013-11-14 07:00:32 [23455] [0] *** Timeout while processing Host: "hostname" Service: "_HOST_"
2013-11-14 07:00:32 [23459] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1384408818.perfdata.service-PID-23459 deleted
2013-11-14 07:00:32 [23455] [0] *** process_perfdata.pl terminated on signal ALRM
2013-11-14 07:00:32 [23459] [0] *** Timeout while processing Host: "hostname" Service: "SRV_CPU"
2013-11-14 07:00:32 [23459] [0] *** process_perfdata.pl terminated on signal ALRM
There seems to be no activity after 07:00...
ls /usr/local/nagios/var/spool/checkresults | wc -l
9
ls /usr/local/nagios/var/spool/perfdata | wc -l
5907
ls /usr/local/nagios/var/spool/xidpe | wc -l
0
Any help is welcome!
Already did a restart of npcd
service npcd restart
[root@nagios perfdata]# df -i
Filesystem Inodes IUsed IFree IUse% Mounted on
/dev/mapper/VolGroup00-LogVol00
1286144 111734 1174410 9% /
tmpfs 490139 1 490138 1% /dev/shm
/dev/sda1 25688 39 25649 1% /boot
cifs:/root_vdm_2/cifs402/Backups/SRVNAGIOS01
257949694 3859915 254089779 2% /var/Digipolis/Backup
[root@nagios perfdata]# df -h
Filesystem Size Used Avail Use% Mounted on
/dev/mapper/VolGroup00-LogVol00
20G 7.4G 11G 41% /
tmpfs 1.9G 0 1.9G 0% /dev/shm
/dev/sda1 97M 28M 65M 31% /boot
cifs:/root_vdm_2/cifs402/Backups/SRVNAGIOS01
4.9T 2.9T 2.0T 60% /var/Digipolis/Backup
cat /usr/local/nagios/etc/nagios.cfg | grep check_result
check_result_path=/usr/local/nagios/var/spool/checkresults
check_result_reaper_frequency=10
max_check_result_file_age=3600
max_check_result_reaper_time=30
nano /usr/local/nagios/etc/pnp/process_perfdata.cfg
Changed TIMEOUT to 10 and restarted NPCD service.
Tried to gather as much info as I could find.