Page 4 of 4

Re: rrd does not exist

Posted: Mon Feb 11, 2013 11:55 am
by slansing
Hello Raj,

Have you taken a look at this thread? Since you and David.adder had the same issue with rrd's, and then he had a problem with graphs not showing up I thought they may be similar:

http://support.nagios.com/forum/viewtop ... f=6&t=9254

Re: rrd does not exist

Posted: Mon Feb 11, 2013 10:08 pm
by gm_rajkumar
Hi,

I have tried the http://assets.nagios.com/downloads/nagi ... ios_XI.pdf steps, i have resolved the rrd does not exist issue. however, i couldnt see the performance graphs on router/switches and other devices. Rest of windows machines and linux machines are showing properly..
i ran the below command found that it was updated on 8th jan 2013, is there any issue on fetching data?? i'm bit confused..

tail -50 /usr/local/nagios/var/perfdata.log
2013-01-08 13:47:31 [2823] [0] *** process_perfdata.pl terminated on signal ALRM
2013-01-08 13:49:27 [12211] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-01-08 13:49:27 [12211] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-01-08 13:49:27 [12211] [0] *** TIMEOUT: Please check your npcd.cfg
2013-01-08 13:49:27 [12211] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1357638546.perfdata.service-PID-12211 deleted
2013-01-08 13:49:27 [12211] [0] *** Timeout while processing Host: "192.168.104.2" Service: "users_11f_Bandwidth"
2013-01-08 13:49:27 [12211] [0] *** process_perfdata.pl terminated on signal ALRM
2013-01-08 13:50:53 [16408] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-01-08 13:50:53 [16408] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-01-08 13:50:53 [16408] [0] *** TIMEOUT: Please check your npcd.cfg
2013-01-08 13:50:53 [16408] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1357638636.perfdata.service-PID-16408 deleted
2013-01-08 13:50:53 [16408] [0] *** Timeout while processing Host: "192.168.104.2" Service: "netapps-cfs1-4_Bandwidth"
2013-01-08 13:50:53 [16408] [0] *** process_perfdata.pl terminated on signal ALRM
2013-01-08 13:51:30 [19935] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-01-08 13:51:30 [19935] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-01-08 13:51:30 [19935] [0] *** TIMEOUT: Please check your npcd.cfg
2013-01-08 13:51:30 [19935] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1357638681.perfdata.service-PID-19935 deleted
2013-01-08 13:51:30 [19935] [0] *** Timeout while processing Host: "192.168.104.2" Service: "mgmt_switches_12f_Bandwidth"
2013-01-08 13:51:30 [19935] [0] *** process_perfdata.pl terminated on signal ALRM

let me know if you need further..
thanks!!
Regards,
Raj.

Re: rrd does not exist

Posted: Tue Feb 12, 2013 12:32 pm
by slansing
Can you post your process_perfdata.cfg file?

Code: Select all

/usr/local/nagios/etc/pnp/process_perfdata.cfg
Also for good measure run the following:

Code: Select all

service npcd stop

killall -9 npcd

service npcd start

Re: rrd does not exist

Posted: Tue Feb 12, 2013 11:08 pm
by gm_rajkumar
Hi,

Please find below npcd.cfg, perfdata.log and npcd.log file for your reference.

#
# Config File for process_perfdata.pl
#
# $Id: process_perfdata.cfg-sample.in 520 2008-09-16 12:50:10Z pitchfork $
#
# process_perfdata.pl Timout
#
TIMEOUT = 5
#
# Use RRDs Perl Module
#
USE_RRDs = 1
#
#
#
RRDPATH = /usr/local/nagios/share/perfdata
#
#
#
RRDTOOL = /usr/bin/rrdtool
#
#
#
CFG_DIR = /usr/local/nagios/etc/pnp
#
#
#
RRD_HEARTBEAT = 8460
#
#
#
RRA_CFG = /usr/local/nagios/etc/pnp/rra.cfg
#
#
#
RRA_STEP = 60
#
#
#
LOG_FILE = /usr/local/nagios/var/perfdata.log
#
# Loglevel 0=silent 1=normal 2=debug
#
LOG_LEVEL = 0
#
# XML encoding
# The supported encodings are ISO-8859-1, UTF-8 and US-ASCII.
# http://www.php.net/xml-parser-create
XML_ENC = UTF-8
#
# EXPERIMENTAL rrdcached Support
# Use only with rrdtool svn revision 1511+
#
# RRD_DAEMON_OPTS = unix:/tmp/rrdcached.sock

After performing the below steps, still the problem persists.
service npcd stop

killall -9 npcd

service npcd start

[root@monitoring etc]# tail /usr/local/nagios/var/npcd.log
[02-12-2013 22:28:58] NPCD: Error while getting file status[02-12-2013 22:59:27] NPCD: Error while getting file status[02-12-2013 23:34:12] NPCD: Error while getting file status[02-12-2013 23:55:37] NPCD: Error while getting file status[02-13-2013 00:05:41] NPCD: Error while getting file status[02-13-2013 01:01:22] NPCD: Error while getting file status[02-13-2013 01:11:56] NPCD: ERROR: Executed command exits with return code '4'
[02-13-2013 01:11:56] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1360703508.perfdata.service'
[02-13-2013 02:18:40] NPCD: Error while getting file status[02-13-2013 02:54:56] NPCD: Error while getting file status[02-13-2013 03:03:28] NPCD: ERROR: Executed command exits with return code '4'
[02-13-2013 03:03:28] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1360710198.perfdata.host'
[02-13-2013 03:55:50] NPCD: Error while getting file status[02-13-2013 05:15:45] NPCD: ERROR: Executed command exits with return code '4'
[02-13-2013 05:15:45] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1360718133.perfdata.service'
[02-13-2013 05:39:57] NPCD: ERROR: Executed command exits with return code '4'
[02-13-2013 05:39:57] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1360719588.perfdata.service'
[02-13-2013 06:12:58] NPCD: Error while getting file status[02-13-2013 07:03:21] NPCD: Error while getting file status[02-13-2013 07:55:43] NPCD: ERROR: Executed command exits with return code '4'
[02-13-2013 07:55:43] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1360727718.perfdata.host'
You have new mail in /var/spool/mail/root

[root@monitoring etc]# tail /usr/local/nagios/var/perfdata.log
2013-01-08 13:50:53 [16408] [0] *** TIMEOUT: Please check your npcd.cfg
2013-01-08 13:50:53 [16408] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1357638636.perfdata.service-PID-16408 deleted
2013-01-08 13:50:53 [16408] [0] *** Timeout while processing Host: "192.168.104.2" Service: "netapps-cfs1-4_Bandwidth"
2013-01-08 13:50:53 [16408] [0] *** process_perfdata.pl terminated on signal ALRM
2013-01-08 13:51:30 [19935] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-01-08 13:51:30 [19935] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-01-08 13:51:30 [19935] [0] *** TIMEOUT: Please check your npcd.cfg
2013-01-08 13:51:30 [19935] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1357638681.perfdata.service-PID-19935 deleted
2013-01-08 13:51:30 [19935] [0] *** Timeout while processing Host: "192.168.104.2" Service: "mgmt_switches_12f_Bandwidth"
2013-01-08 13:51:30 [19935] [0] *** process_perfdata.pl terminated on signal ALRM

Re: rrd does not exist

Posted: Wed Feb 13, 2013 3:36 pm
by scottwilkerson
Please run the following and report back.

Code: Select all

df -i
df -h
ls /usr/local/nagios/var/spool/perfdata | wc -l
ls /usr/local/nagios/var/spool/xidpe | wc -l
Also, lets temporarily turn on debug log level in /usr/local/nagios/etc/pnp/process_perfdata.cfg

Code: Select all

LOG_LEVEL = 2
Then wait just a minute and run

Code: Select all

tail -n 80 /usr/local/nagios/var/perfdata.log

Re: rrd does not exist

Posted: Fri Feb 15, 2013 1:14 am
by gm_rajkumar
After a long exertion, we have finally found out the solution for this.. please find below steps to clear it out..

1. Root cause for this issue might be time, i dont know how this time got changed and not reflected. I have updated the time sync..
2. /usr/local/nagios/share/perfdata/ -- you need to look to those file with time stamps, i can see it was all incorrect time/date.In this location, rrd files are getting stored and accessed. Due to which, we are not able to get the Performance graph properly.

3. i have manually cleared those rrd files which has incorrect date/time, after that it started created newly with current time stamps. After that i can see the performance graphs... Wow thats great!!! hope it will give you some idea...

Regards,
Raj.

Re: rrd does not exist

Posted: Fri Feb 15, 2013 10:34 am
by slansing
This was mentioned that you could do this before in this thread:
How important is the historical information on your installation? I ask because by far the easiest way to remedy the situation would be to delete the old .rrd files and let new ones be created.
Glad to hear it all worked out for you though. It seems like yours and David.Adder's cases were both from the same root cause, a change in time as if the system were in the future then you yanked it back to the current time.