Performance Data Not Working
Posted: Fri May 02, 2014 12:48 pm
I've tried sifting through former posts on performance data issues without any real luck. This is a two phase change that occurred with our NagiosXI environment.
1. We needed to move our system off a physical server that requires decommissioning.
2. We wanted to upgrade to 2012 R2.9 to stay current.
The new server is a VM on ESXi. The former version of the software was 2012 R1.6 (So quite a jump.) The first step was to build the VM on R1.6, then restore the latest backup we to this server. Once complete, I logged in and verified things looked good. The second step was to download the R2.9 and run the upgrade script. This completed without error. One item that stood out right away was the performance graphs weren't working. This was apparent because many of our networking checks were complaining about the mrtg .rrd files not being present. I grabbed the files from the old server and populated them on the new one. These errors cleared, but we lack any new performance data.
Hopefully this information will prove helpful:
Something interesting here is that you'll notice the latest timestamp on this log shows day before yesterday. Nothing further has been entered into this log post-migration.
At this point I'm just picking at straws, so any and all help is greatly appreciated. Thanks!
Dan
1. We needed to move our system off a physical server that requires decommissioning.
2. We wanted to upgrade to 2012 R2.9 to stay current.
The new server is a VM on ESXi. The former version of the software was 2012 R1.6 (So quite a jump.) The first step was to build the VM on R1.6, then restore the latest backup we to this server. Once complete, I logged in and verified things looked good. The second step was to download the R2.9 and run the upgrade script. This completed without error. One item that stood out right away was the performance graphs weren't working. This was apparent because many of our networking checks were complaining about the mrtg .rrd files not being present. I grabbed the files from the old server and populated them on the new one. These errors cleared, but we lack any new performance data.
Hopefully this information will prove helpful:
Code: Select all
[root@gwr-noc /]# tail -15 /usr/local/nagios/var/npcd.log
[05-02-2014 12:02:45] NPCD: npcd Daemon (0.4.14) started with PID=18461
[05-02-2014 12:02:45] NPCD: Please have a look at 'npcd -V' to get license information
[05-02-2014 12:02:45] NPCD: HINT: load_threshold is enabled - ('20.000000')
[05-02-2014 12:08:11] NPCD: Caught Termination Signal - Hasta la vista... baby
[05-02-2014 12:08:11] NPCD: npcd Daemon (0.4.14) started with PID=27269
[05-02-2014 12:08:11] NPCD: Please have a look at 'npcd -V' to get license information
[05-02-2014 12:08:11] NPCD: HINT: load_threshold is enabled - ('20.000000')
[05-02-2014 12:19:07] NPCD: Caught Termination Signal - Hasta la vista... baby
[05-02-2014 12:19:07] NPCD: npcd Daemon (0.4.14) started with PID=25768
[05-02-2014 12:19:07] NPCD: Please have a look at 'npcd -V' to get license information
[05-02-2014 12:19:07] NPCD: HINT: load_threshold is enabled - ('20.000000')
[05-02-2014 12:27:23] NPCD: Caught Termination Signal - Hasta la vista... baby
[05-02-2014 12:28:06] NPCD: npcd Daemon (0.4.14) started with PID=3144
[05-02-2014 12:28:06] NPCD: Please have a look at 'npcd -V' to get license information
[05-02-2014 12:28:06] NPCD: HINT: load_threshold is enabled - ('20.000000')
Code: Select all
[root@gwr-noc /]# tail -15 /usr/local/nagios/var/perfdata.log
2014-04-29 17:00:35 [29373] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1398808808.perfdata.host-PID-29373 deleted
2014-04-29 17:00:35 [29373] [0] *** Timeout while processing Host: "WB8000_DELLS_3" Service: "_HOST_"
2014-04-29 17:00:35 [29373] [0] *** process_perfdata.pl terminated on signal ALRM
2014-04-29 17:40:35 [26051] [0] *** TIMEOUT: Timeout after 12 secs. ***
2014-04-29 17:40:35 [26051] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2014-04-29 17:40:35 [26051] [0] *** TIMEOUT: Please check your npcd.cfg
2014-04-29 17:40:35 [26051] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1398811208.perfdata.host-PID-26051 deleted
2014-04-29 17:40:35 [26051] [0] *** Timeout while processing Host: "WI-FILER-IMM" Service: "_HOST_"
2014-04-29 17:40:35 [26051] [0] *** process_perfdata.pl terminated on signal ALRM
2014-04-30 13:05:31 [27739] [0] *** TIMEOUT: Timeout after 12 secs. ***
2014-04-30 13:05:31 [27739] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2014-04-30 13:05:31 [27739] [0] *** TIMEOUT: Please check your npcd.cfg
2014-04-30 13:05:31 [27739] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1398881110.perfdata.host-PID-27739 deleted
2014-04-30 13:05:31 [27739] [0] *** Timeout while processing Host: "WI-SW5" Service: "_HOST_"
2014-04-30 13:05:31 [27739] [0] *** process_perfdata.pl terminated on signal ALRM
Code: Select all
Filesystem Size Used Avail Use% Mounted on
/dev/mapper/VolGroup00-LogVol00
246G 6.7G 227G 3% /
tmpfs 2.9G 0 2.9G 0% /dev/shm
/dev/sda1 97M 28M 65M 31% /boot
Code: Select all
[root@gwr-noc /]# ls /usr/local/nagios/var/spool/xidpe | wc -l
0
Dan