Hey all
Nagios XI 5.4.10 VMWare Appliance.
I'm having an issue with graphing in XI.
When I set log level to 2 and attempted to start/run /usr/local/nagios/libexec/process_perfdata.pl the log file shows the following:
2017-08-02 18:42:55 [6493] [0] *** TIMEOUT: Please check your npcd.cfg
2017-08-02 18:42:55 [6494] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//host-perfdata.1501699323-PID-6494 deleted
2017-08-02 18:42:55 [6494] [0] *** Timeout while processing Host: "flca29.stpete.honeywell.com" Service: "_HOST_"
2017-08-02 18:42:55 [6493] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//service-perfdata.1501699324-PID-6493 deleted
2017-08-02 18:42:55 [6494] [0] *** process_perfdata.pl terminated on signal ALRM
2017-08-02 18:42:55 [6493] [0] *** Timeout while processing Host: "az18u8702.dcw.honeywell.com" Service: "CPU_Usage"
2017-08-02 18:42:55 [6493] [0] *** process_perfdata.pl terminated on signal ALRM
2017-10-31 02:49:05 [5439] [2] Using Config File /usr/local/nagios/etc/pnp/process_perfdata.cfg parameters
2017-10-31 02:49:05 [5439] [1] process_perfdata.pl-0.6.11 starting in SYNC Mode
2017-10-31 02:49:05 [5439] [1] Cant find Nagios Environment. Exiting ....
As you can see there hasn't been graphing data for a bit.
Here is some additional data:
[root@az18u1836 ~]# ls -l /usr/local/nagios/var/spool
total 12
drwxrwsr-x 2 nagios nagcmd 4096 Oct 31 03:15 checkresults
drwxr-xr-x 2 nagios nagios 4096 Aug 15 18:00 perfdata
drwxr-xr-x 2 nagios nagios 4096 Oct 28 11:47 xidpe
[root@az18u1836 ~]# ls -l /usr/local/nagios/var/
total 388892
drwxrwxr-x 2 nagios nagios 20480 Oct 30 23:59 archives
-rw-r--r-- 1 nagios nagios 126840319 Oct 31 03:15 host-perfdata
-rw-r--r-- 1 nagios nagios 40855 Oct 31 03:15 livestatus.log
-rw-r--r-- 1 nagios nagios 367 Oct 31 03:15 nagios.configtest
-rw-r--r-- 1 nagios nagios 6 Oct 31 03:15 nagios.lock
-rw-r--r-- 1 nagios nagios 2621671 Oct 31 03:15 nagios.log
-rw-r--r-- 1 nagios nagios 5 Sep 27 15:49 ndo2db.lock
-rw-r--r-- 1 nagios nagios 0 Oct 31 03:15 ndomod.tmp
srwxr-xr-x 1 nagios nagios 0 Sep 27 15:49 ndo.sock
-rw-r--r-- 1 nagios nagios 11029 Oct 31 03:13 npcd.log
-rw-r--r-- 1 nagios nagios 3217349 Oct 31 03:15 objects.cache
-rw-r--r-- 1 nagios nagios 3217349 Oct 31 03:15 objects.precache
-rw-rw-r-- 1 nagios nagios 1623 Oct 31 03:09 perfdata.log
-rw------- 1 nagios nagios 5416155 Oct 31 03:15 retention.dat
drwxrwsr-x 2 nagios nagcmd 4096 Oct 31 03:15 rw
-rw-r--r-- 1 nagios nagios 251394697 Oct 31 03:15 service-perfdata
drwxr-xr-x 5 nagios nagios 4096 Oct 4 2016 spool
drwxr-xr-x 2 nagios nagios 4096 Aug 15 18:00 stats
-rw-rw-r-- 1 nagios nagios 5388086 Oct 31 03:15 status.dat
[root@az18u1836 ~]# grep perfdata /usr/local/nagios/etc/nagios.cfg
service_perfdata_file=/usr/local/nagios/var/service-perfdata
service_perfdata_file_template=DATATYPE::SERVICEPERFDATA\tTIMET::$TIMET$\tHOSTNAME::$HOSTNAME$\tSERVICEDESC::$SERVICEDESC$\tSERVICEPERFDATA::$SERVICEPERFDATA$\tSERVICECHECKCOMMAND::$SERVICECHECKCOMMAND$\tHOSTSTATE::$HOSTSTATE$\tHOSTSTATETYPE::$HOSTSTATETYPE$\tSERVICESTATE::$SERVICESTATE$\tSERVICESTATETYPE::$SERVICESTATETYPE$\tSERVICEOUTPUT::$SERVICEOUTPUT$\tLONGSERVICEOUTPUT::$LONGSERVICEOUTPUT$
service_perfdata_file_mode=a
service_perfdata_file_processing_interval=86400
service_perfdata_file_processing_command=process-service-perfdata-file-bulk
host_perfdata_file=/usr/local/nagios/var/host-perfdata
host_perfdata_file_template=DATATYPE::HOSTPERFDATA\tTIMET::$TIMET$\tHOSTNAME::$HOSTNAME$\tHOSTPERFDATA::$HOSTPERFDATA$\tHOSTCHECKCOMMAND::$HOSTCHECKCOMMAND$\tHOSTSTATE::$HOSTSTATE$\tHOSTSTATETYPE::$HOSTSTATETYPE$\tHOSTOUTPUT::$HOSTOUTPUT$\tLONGHOSTOUTPUT::$LONGHOSTOUTPUT$
host_perfdata_file_mode=a
host_perfdata_file_processing_interval=86400
host_perfdata_file_processing_command=process-host-perfdata-file-bulk
perfdata_timeout=5
Can anyone suggest possible corrections to allow perf graphing to again be made available to our customers?
Thanks
Graphing issue
-
kyang
Re: Graphing issue
Did you run through some of the steps in the troubleshooting article on performance graphs?
https://support.nagios.com/kb/article.php?id=9
Can you show us your NPCD logs also? (once you've increased the logging verbosity found in the article)
You could also send us the profile while you're at it, that way I'm not asking back and forth for information that can be found in there.
You can download it by going to Admin -> System Config -> System Profile -> Download Profile
(You can PM me the file, as it may be quite large and if you wanted to take out any sensitive information that's ok)
https://support.nagios.com/kb/article.php?id=9
Can you show us your NPCD logs also? (once you've increased the logging verbosity found in the article)
Code: Select all
tail -f /usr/local/nagios/var/npcd.logYou can download it by going to Admin -> System Config -> System Profile -> Download Profile
(You can PM me the file, as it may be quite large and if you wanted to take out any sensitive information that's ok)