Page 2 of 3
Re: No Metrics or Performance Graphs
Posted: Thu Feb 14, 2013 10:15 am
by slansing
It looks like the load issue was fixed, what is the output of:
Code: Select all
tail -50 /usr/local/nagios/var/perfdata.log
Re: No Metrics or Performance Graphs
Posted: Thu Feb 14, 2013 11:46 am
by RB1982
Heres the output.
I still dont have any graphs or metrics
Code: Select all
2013-01-21 11:11:53 [29452] [0] *** Timeout while processing Host: "resdc01.ds-s.com" Service: "_HOST_"
2013-01-21 11:11:53 [29452] [0] *** process_perfdata.pl terminated on signal ALRM
2013-01-21 11:12:20 [29491] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-01-21 11:12:38 [29491] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-01-21 11:12:38 [29491] [0] *** TIMEOUT: Please check your npcd.cfg
2013-01-21 11:12:38 [29491] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1358764178.perfdata.host-PID-29491 deleted
2013-01-21 11:12:38 [29495] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-01-21 11:12:38 [29491] [0] *** Timeout while processing Host: "dervhost01.ds-s.com" Service: "_HOST_"
2013-01-21 11:12:38 [29495] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-01-21 11:12:38 [29491] [0] *** process_perfdata.pl terminated on signal ALRM
2013-01-21 11:12:38 [29495] [0] *** TIMEOUT: Please check your npcd.cfg
2013-01-21 11:12:38 [29495] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1358764163.perfdata.service-PID-29495 deleted
2013-01-21 11:12:38 [29495] [0] *** Timeout while processing Host: "" Service: ""
2013-01-21 11:12:38 [29495] [0] *** process_perfdata.pl terminated on signal ALRM
2013-01-21 11:12:38 [29492] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-01-21 11:12:43 [29492] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-01-21 11:12:44 [29492] [0] *** TIMEOUT: Please check your npcd.cfg
2013-01-21 11:12:44 [29493] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-01-21 11:12:44 [29493] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-01-21 11:12:44 [29493] [0] *** TIMEOUT: Please check your npcd.cfg
2013-01-21 11:12:44 [29496] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-01-21 11:12:44 [29496] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-01-21 11:12:44 [29496] [0] *** TIMEOUT: Please check your npcd.cfg
2013-01-21 11:12:44 [29492] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1358764193.perfdata.service-PID-29492 deleted
2013-01-21 11:12:44 [29493] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1358764178.perfdata.service-PID-29493 deleted
2013-01-21 11:12:44 [29492] [0] *** Timeout while processing Host: "" Service: ""
2013-01-21 11:12:44 [29492] [0] *** process_perfdata.pl terminated on signal ALRM
2013-01-21 11:12:44 [29493] [0] *** Timeout while processing Host: "" Service: ""
2013-01-21 11:12:44 [29493] [0] *** process_perfdata.pl terminated on signal ALRM
2013-01-21 11:12:44 [29496] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1358764193.perfdata.host-PID-29496 deleted
2013-01-21 11:12:49 [29496] [0] *** Timeout while processing Host: "" Service: ""
2013-01-21 11:13:05 [29496] [0] *** process_perfdata.pl terminated on signal ALRM
2013-01-21 11:21:56 [4037] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-01-21 11:21:56 [4037] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-01-21 11:21:56 [4037] [0] *** TIMEOUT: Please check your npcd.cfg
2013-01-21 11:21:56 [4037] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1358764268.perfdata.host-PID-4037 deleted
2013-01-21 11:21:56 [4037] [0] *** Timeout while processing Host: "y.y.com.com" Service: "_HOST_"
2013-01-21 11:21:56 [4037] [0] *** process_perfdata.pl terminated on signal ALRM
2013-02-09 01:04:38 [30423] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-02-09 01:04:38 [30423] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-02-09 01:04:38 [30423] [0] *** TIMEOUT: Please check your npcd.cfg
2013-02-09 01:04:38 [30423] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1360371850.perfdata.host-PID-30423 deleted
2013-02-09 01:04:38 [30423] [0] *** Timeout while processing Host: "y.y.com" Service: "_HOST_"
2013-02-09 01:04:38 [30423] [0] *** process_perfdata.pl terminated on signal ALRM
2013-02-09 01:25:46 [425] [0] *** TIMEOUT: Timeout after 5 secs. ***
2013-02-09 01:25:46 [425] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2013-02-09 01:25:46 [425] [0] *** TIMEOUT: Please check your npcd.cfg
2013-02-09 01:25:46 [425] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1360373130.perfdata.host-PID-425 deleted
2013-02-09 01:25:46 [425] [0] *** Timeout while processing Host: "y.y.com" Service: "_HOST_"
2013-02-09 01:25:46 [425] [0] *** process_perfdata.pl terminated on signal ALRM
Re: No Metrics or Performance Graphs
Posted: Thu Feb 14, 2013 11:57 am
by abrist
You are still experiencing timeouts for perfdata processing. Did you change the TIMEOUT setting as instructed by slansing? (you will need to restart npcd after the change)
Re: No Metrics or Performance Graphs
Posted: Fri Feb 15, 2013 5:10 am
by RB1982
I have changed the value as perviously instructed.
I have restarted the service, I even rebooted the box to be sure.
No change.
Re: No Metrics or Performance Graphs
Posted: Fri Feb 15, 2013 11:07 am
by abrist
Could you post the output of the following command in a code wrap?
Code: Select all
cat /usr/local/nagios/etc/pnp/process_perfdata.cfg | grep -5 TIMEOUT
Re: No Metrics or Performance Graphs
Posted: Mon Feb 18, 2013 4:52 am
by RB1982
Code: Select all
#
# $Id: process_perfdata.cfg-sample.in 520 2008-09-16 12:50:10Z pitchfork $
#
# process_perfdata.pl Timout
#
TIMEOUT = 15
#
# Use RRDs Perl Module
#
USE_RRDs = 1
#
Re: No Metrics or Performance Graphs
Posted: Mon Feb 18, 2013 8:31 am
by scottwilkerson
Thanks, please also post you /usr/local/nagios/etc/pnp/npcd.cfg
As well as the output of the following
Re: No Metrics or Performance Graphs
Posted: Mon Feb 18, 2013 10:48 am
by RB1982
Code: Select all
/usr/local/nagios/etc/pnp/npcd.cfg: line 20: user: command not found
/usr/local/nagios/etc/pnp/npcd.cfg: line 21: group: command not found
/usr/local/nagios/etc/pnp/npcd.cfg: line 32: log_type: command not found
/usr/local/nagios/etc/pnp/npcd.cfg: line 42: log_file: command not found
/usr/local/nagios/etc/pnp/npcd.cfg: line 51: max_logfile_size: command not found
/usr/local/nagios/etc/pnp/npcd.cfg: line 64: log_level: command not found
/usr/local/nagios/etc/pnp/npcd.cfg: line 75: perfdata_spool_dir: command not found
/usr/local/nagios/etc/pnp/npcd.cfg: line 88: perfdata_file_run_cmd: command not found
/usr/local/nagios/etc/pnp/npcd.cfg: line 101: perfdata_file_run_cmd_args: command not found
/usr/local/nagios/etc/pnp/npcd.cfg: line 107: npcd_max_threads: command not found
/usr/local/nagios/etc/pnp/npcd.cfg: line 113: sleep_time: command not found
/usr/local/nagios/etc/pnp/npcd.cfg: line 136: load_threshold: command not found
Code: Select all
/tmp/nagiosxi/subcomponents/pnp/mods/cfg/npcd.cfg
/usr/local/nagios/etc/pnp/npcd.cfg-sample
/usr/local/nagios/etc/pnp/npcd.cfg
Re: No Metrics or Performance Graphs
Posted: Mon Feb 18, 2013 11:19 am
by scottwilkerson
please also post your /usr/local/nagios/etc/pnp/npcd.cfg
Re: No Metrics or Performance Graphs
Posted: Tue Feb 19, 2013 4:12 am
by RB1982
Does anyone have any thoughts here or should i just reopen my previous support ticket?
I did have this in an actual support ticket, but i dont seem to be getting any response to my email for it.
# NPCD.cfg - sample configuration file for PNPs NPCD
#
# This program is free software; you can redistribute it and/or modify
# it under the terms of the GNU General Public License as published by
# the Free Software Foundation; either version 2 of the License, or
# (at your option) any later version.
#
# This program is free software; you can redistribute it and/or modify
# it under the terms of the GNU General Public License version 2 as
# published by the Free Software Foundation;
#
# You should have received a copy of the GNU General Public License
# along with this program; if not, write to the Free Software
# Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
#
# $Id: npcd.cfg-sample.in 440 2008-04-24 09:08:20Z hendrikb $
# Privilege Options
user = nagios
group = nagios
# Logging Options
# log_type - define if you want your logs to
# 'syslog' or to a 'file'
#
# log_type = <value>
#
log_type = file
#log_type = syslog
# log_file - define a path to your logfile
# needed if 'log_type'='file'
#
# log_file = </path/to/logpath/filename>
#
log_file = /usr/local/nagios/var/npcd.log
# max_logfile_size - defines the maximum filesize (bytes)
# before the logfile will rotated.
#
# max_logfile_size = <value> (default 10Mbyte)
#
max_logfile_size = 10485760
# log_level - how much should we log?
#
# log_level = <integer value>
#
# 0 = No logging - except errors
# 1 = Small logging - some few more output
# 2 = More Logging (actual ALL logs)
# -1 = DEBUG Mode - ALL Logging and slower processing
#
log_level = 2
# NEEDED OPTIONS
#
# perfdata_spool_dir - where we can find the
# performance data files
#
# perfdata_spool_dir = </path/to/directory/>
#
perfdata_spool_dir = /usr/local/nagios/var/spool/perfdata/
# Execute following command for each found file
# in 'perfdata_spool_dir'
#
# perfdata_file_run_cmd = </path/to/command>
#
# Must be executable by user/group from above
#
# perfdata_file_run_cmd = </path/to/filename>
#
perfdata_file_run_cmd = /usr/local/nagios/libexec/process_perfdata.pl
# perfdata_file_run_cmd_args (optional)
#
# If you wish, you can apply more arguments to the
# perfdata_file_run_cmd
#
# Hint:
# NPCD will create a command line like this:
# '<perfdata_file_runc_cmd> <perfdata_file_runc_cmd_args> <filename_from_spool_dir>'
#
perfdata_file_run_cmd_args = -b
# npcd_max_threads - define how many parallel threads we
# should start
npcd_max_threads = 5
# sleep_time - how many seconds should npcd wait between dirscans
#
# sleep_time = 15 (default)
sleep_time = 15
# EXPERIMENTAL
#
# use_load_threshold - enables/disables load watching
#
# use_load_threshold = <0 / 1> (default: 0)
#
#use_load_threshold = 0
# EXPERIMENTAL
#
# load_threshold - npcd won't start new threads
# if your system load is over this threshold
#
# load_threshold = <float value> (default: 10.0)
#
# Hint: Do not use "," as decimal delimeter
#
load_threshold = 15.0
# location of your pid file
#pid_file=/var/run/npcd.pid
pid_file=/usr/local/nagiosxi/var/subsys/npcd.pid
# We have to end with a newline