Page 2 of 6
Re: Problems with Host Performance Graphs and Bandwidth
Posted: Tue Jan 22, 2013 10:36 am
by David.adder
ll /usr/local/nagios/var/spool/xdpe|wc -l
ls: no se puede acceder a /usr/local/nagios/var/spool/xdpe: No existe el fichero o el directorio
0
ll /usr/local/nagios/var
total 130220
drwxrwxr-x 2 nagios nagios 32768 ene 22 00:01 archives
-rw-r--r-- 1 apache apache 236947 jun 20 2012 graphapi.log
-rw-rw-r-- 1 nagios users 7787 ene 22 16:34 host-perfdata
-rw-rw-r-- 1 nagios users 419170 ene 22 16:34 nagios.debug
-rw-rw-r-- 1 nagios users 1000002 ene 22 16:34 nagios.debug.old
-rw-r--r-- 1 nagios nagios 6 ene 22 16:32 nagios.lock
-rw-r--r-- 1 nagios nagios 90616536 ene 22 16:34 nagios.log
-rw------- 1 nagios users 2842624 feb 20 2012 nagios.tmp4YAE9z
-rw-r--r-- 1 nagios nagios 5 nov 13 11:54 ndo2db.lock
-rw-rw-r-- 1 nagios nagios 0 ene 22 16:25 ndomod.tmp
srwxr-xr-x 1 nagios nagios 0 nov 13 11:54 ndo.sock
-rw-r--r-- 1 nagios nagios 8475719 ene 22 16:25 npcd.log
-rw-r--r-- 1 nagios nagios 10485821 ene 13 04:12 npcd.log.old
-rw-r--r-- 1 nagios nagios 3449179 ene 22 16:32 objects.cache
-rw-rw-rw- 1 nagios nagios 5371858 ene 22 16:25 perfdata.log
-rw------- 1 nagios users 5121518 ene 22 16:32 retention.dat
drwxrwsr-x 2 nagios nagcmd 4096 ene 22 16:32 rw
-rw-rw-r-- 1 nagios users 49415 ene 22 16:34 service-perfdata
drwxr-xr-x 5 nagios nagios 4096 ene 26 2011 spool
drwxr-xr-x 2 nagios nagios 4096 ene 22 16:34 stats
-rw-rw-r-- 1 nagios users 5114794 ene 22 16:34 status.dat
Re: Problems with Host Performance Graphs and Bandwidth
Posted: Tue Jan 22, 2013 11:19 am
by scottwilkerson
Actually there was a typo in mguthrie's first command, please run
Code: Select all
ll /usr/local/nagios/var/spool/xidpe|wc -l
Re: Problems with Host Performance Graphs and Bandwidth
Posted: Tue Jan 22, 2013 11:23 am
by David.adder
ll /usr/local/nagios/var/spool/xidpe|wc -l
1
Re: Problems with Host Performance Graphs and Bandwidth
Posted: Tue Jan 22, 2013 11:27 am
by scottwilkerson
It appears to be keeping up.
Lets change the timeout in the following file
Code: Select all
/usr/local/nagios/etc/pnp/process_perfdata.cfg
to
Re: Problems with Host Performance Graphs and Bandwidth
Posted: Tue Jan 22, 2013 11:42 am
by David.adder
I don't know if it is normal, but the value changes after change the TIMEOUT:
ll /usr/local/nagios/var/spool/xidpe|wc -l
1
ll /usr/local/nagios/var/spool/xidpe|wc -l
3
Still get the same errors in "/usr/local/nagios/var/perfdata.log" and "/usr/local/nagios/var/npcd.log"
Re: Problems with Host Performance Graphs and Bandwidth
Posted: Tue Jan 22, 2013 1:33 pm
by mguthrie
-rw-rw-r-- 1 nagios users 49415 ene 22 16:34 service-perfdata
For some reason the backup is happening at the file above. Run:
Code: Select all
echo "" > /usr/local/nagios/var/service-perfdata
To clear file and reset the queue. This probably backed up because of the load threshold.
Re: Problems with Host Performance Graphs and Bandwidth
Posted: Tue Jan 22, 2013 2:49 pm
by David.adder
I've run that command.
ll /usr/local/nagios/var
total 152636
drwxrwxr-x 2 nagios nagios 32768 ene 22 00:01 archives
-rw-r--r-- 1 apache apache 236947 jun 20 2012 graphapi.log
-rw-rw-r-- 1 nagios users 0 ene 22 20:44 host-perfdata
-rw-rw-r-- 1 nagios users 95820 ene 22 20:44 nagios.debug
-rw-rw-r-- 1 nagios users 1000028 ene 22 20:44 nagios.debug.old
-rw-r--r-- 1 nagios nagios 6 ene 22 18:04 nagios.lock
-rw-r--r-- 1 nagios nagios 113942569 ene 22 20:44 nagios.log
-rw------- 1 nagios users 2842624 feb 20 2012 nagios.tmp4YAE9z
-rw-r--r-- 1 nagios nagios 5 nov 13 11:54 ndo2db.lock
-rw-rw-r-- 1 nagios nagios 0 ene 22 18:04 ndomod.tmp
srwxr-xr-x 1 nagios nagios 0 nov 13 11:54 ndo.sock
-rw-r--r-- 1 nagios nagios 8482726 ene 22 20:35 npcd.log
-rw-r--r-- 1 nagios nagios 10485821 ene 13 04:12 npcd.log.old
-rw-r--r-- 1 nagios nagios 3441919 ene 22 18:04 objects.cache
-rw-rw-rw- 1 nagios nagios 5387015 ene 22 20:35 perfdata.log
-rw------- 1 nagios users 5126349 ene 22 20:04 retention.dat
drwxrwsr-x 2 nagios nagcmd 4096 ene 22 18:04 rw
-rw-rw-r-- 1 nagios users 0 ene 22 20:44 service-perfdata
drwxr-xr-x 5 nagios nagios 4096 ene 26 2011 spool
drwxr-xr-x 2 nagios nagios 4096 ene 22 20:44 stats
-rw-rw-r-- 1 nagios users 5110348 ene 22 20:44 status.dat
But still getting the same:
tail /usr/local/nagios/var/perfdata.log
2013-01-22 20:29:00 [4613] [0] *** TIMEOUT: Please check your npcd.cfg
2013-01-22 20:29:00 [4613] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//service-perfdata.1358882905-PID-4613 deleted
2013-01-22 20:29:00 [4613] [0] *** Timeout while processing Host: "AMPEREBCM" Service: "my_c_drive"
2013-01-22 20:29:00 [4613] [0] *** process_perfdata.pl terminated on signal ALRM
2013-01-22 20:35:02 [2844] [0] *** TIMEOUT: Timeout after 15 secs. ***
2013-01-22 20:35:02 [2844] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-01-22 20:35:02 [2844] [0] *** TIMEOUT: Please check your npcd.cfg
2013-01-22 20:35:02 [2844] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//service-perfdata.1358883265-PID-2844 deleted
2013-01-22 20:35:02 [2844] [0] *** Timeout while processing Host: "NY-ARIES" Service: "my_c_drive_check"
2013-01-22 20:35:02 [2844] [0] *** process_perfdata.pl terminated on signal ALRM
tail /usr/local/nagios/var/npcd.log
[01-22-2013 20:26:39] NPCD: ERROR: Executed command exits with return code '7'
[01-22-2013 20:26:39] NPCD: ERROR: Executed command exits with return code '7'
[01-22-2013 20:26:39] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//service-perfdata.1358882770'
[01-22-2013 20:26:39] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//host-perfdata.1358882770'
[01-22-2013 20:29:00] NPCD: ERROR: Executed command exits with return code '7'
[01-22-2013 20:29:00] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//host-perfdata.1358882905'
[01-22-2013 20:29:00] NPCD: ERROR: Executed command exits with return code '7'
[01-22-2013 20:29:00] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//service-perfdata.1358882905'
[01-22-2013 20:35:02] NPCD: ERROR: Executed command exits with return code '7'
[01-22-2013 20:35:02] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//service-perfdata.1358883265'
ll /usr/local/nagios/var/spool/xidpe|wc -l
3
Re: Problems with Host Performance Graphs and Bandwidth
Posted: Wed Jan 23, 2013 10:40 am
by mguthrie
Currently are you missing data from all graphs or just bandwidth graphs?
Re: Problems with Host Performance Graphs and Bandwidth
Posted: Wed Jan 23, 2013 11:45 am
by David.adder
Actually, in the Bandwidth. Graphs are not uniform, yo can see in the picture that I attached in the first post. That is a firewall that has activity all the time, and graphs don't show that.
Re: Problems with Host Performance Graphs and Bandwidth
Posted: Thu Jan 24, 2013 11:51 am
by scottwilkerson
One more command... Can I have you run
Code: Select all
ls -l /usr/local/nagios/var/spool/perfdata|wc -l