Page 1 of 4

Problem Graphing Host with 4hr Check Interval

Posted: Mon Apr 22, 2013 10:07 am
by mrochelle
Nagios XI Version : 2012R1.6
Nagios Server 2.6.32-279.11.1.el6.x86_64 x86_64
CentOS release 6.3 (Final)
Gnome is not installed

I'm having a problem getting the graphs to display for a host network device that we only need to monitor once every 4 hours. From the advanced tab the performance data is captured.
4Hr_Host_Perf_tab.PNG
Since all my graphing for a number of other hosts and services work fine, the only difference is the check_interval is 240, I'm guessing I may have exceeded some default limit or need to modify some graph configuration parameter?
Thanks, Marcus

Re: Problem Graphing Host with 4hr Check Interval

Posted: Mon Apr 22, 2013 11:27 am
by abrist
What is the full command for your check? The performance data looks suspect . . .

Re: Problem Graphing Host with 4hr Check Interval

Posted: Mon Apr 22, 2013 11:36 am
by mrochelle
/usr/local/nagios/libexec/check_device.pl --device $HOSTALIAS$ --hostaddr xxx.xxx.xxx.xxx

The performance data is correct. 1 means OK the check is good. 0 of course means the check failed.

Re: Problem Graphing Host with 4hr Check Interval

Posted: Mon Apr 22, 2013 11:51 am
by sreinhardt
How long has this check been collecting data\running for? Its less about the check interval that you have and more that graph explorer requires a certain amount of data before it creates meaningful graphs. This is also the same with capacity planning. At once every 4 hours, it may take a few days before it has sufficient data to create a graph.

Re: Problem Graphing Host with 4hr Check Interval

Posted: Mon Apr 22, 2013 12:19 pm
by mrochelle
I believe you may have a point. The performance data was just turned on two days ago. I've noticed that some of the hosts where I have generated several immediate checks have started graphing. I will monitor over the next few days to check results.

Re: Problem Graphing Host with 4hr Check Interval

Posted: Mon Apr 22, 2013 12:30 pm
by abrist
Great. Let us know how the graphs turn out.

Re: Problem Graphing Host with 4hr Check Interval

Posted: Mon Apr 29, 2013 2:58 pm
by mrochelle
Well, it has been over a week now and we are not generating reports. I believe there is probably some tweeking required. Suggestions? :geek:

Re: Problem Graphing Host with 4hr Check Interval

Posted: Mon Apr 29, 2013 3:29 pm
by abrist
Could you post the output of (replace <hostname> with the hostname or ip of the host in question):

Code: Select all

cat /usr/local/nagios/var/perfdata.log | grep <hostname>
tail -25 /usr/local/nagios/var/npcd.log

Re: Problem Graphing Host with 4hr Check Interval

Posted: Mon Apr 29, 2013 3:31 pm
by scottwilkerson
also, can you verify some permissions

Code: Select all

ls -ld /usr/local/nagios/share/perfdata
And the following replacing <HOSTNAME> with this hosts name

Code: Select all

ls -l /usr/local/nagios/share/perfdata/<HOSTNAME>

Re: Problem Graphing Host with 4hr Check Interval

Posted: Mon Apr 29, 2013 3:49 pm
by mrochelle
# cat /usr/local/nagios/var/perfdata.log | grep ET_50063

# tail -25 /usr/local/nagios/var/npcd.log
[01-16-2013 16:40:48] NPCD: HINT: load_threshold is enabled - ('10.000000')
[01-16-2013 16:43:59] NPCD: Caught Termination Signal - Hasta la vista... baby
[01-17-2013 08:28:22] NPCD: npcd Daemon (0.4.14) started with PID=1632
[01-17-2013 08:28:22] NPCD: Please have a look at 'npcd -V' to get license information
[01-17-2013 08:28:22] NPCD: HINT: load_threshold is enabled - ('10.000000')
[01-17-2013 08:33:32] NPCD: Caught Termination Signal - Hasta la vista... baby
[01-31-2013 12:57:11] NPCD: npcd Daemon (0.4.14) started with PID=1646
[01-31-2013 12:57:11] NPCD: Please have a look at 'npcd -V' to get license information
[01-31-2013 12:57:11] NPCD: HINT: load_threshold is enabled - ('10.000000')
[01-31-2013 14:07:26] NPCD: Caught Termination Signal - Hasta la vista... baby
[01-31-2013 17:05:59] NPCD: npcd Daemon (0.4.14) started with PID=1611
[01-31-2013 17:05:59] NPCD: Please have a look at 'npcd -V' to get license information
[01-31-2013 17:05:59] NPCD: HINT: load_threshold is enabled - ('10.000000')
[01-31-2013 17:53:04] NPCD: Caught Termination Signal - Hasta la vista... baby
[02-14-2013 09:50:05] NPCD: npcd Daemon (0.4.14) started with PID=1644
[02-14-2013 09:50:05] NPCD: Please have a look at 'npcd -V' to get license information
[02-14-2013 09:50:05] NPCD: HINT: load_threshold is enabled - ('10.000000')
[02-14-2013 10:01:27] NPCD: Caught Termination Signal - Hasta la vista... baby
[03-12-2013 21:11:38] NPCD: npcd Daemon (0.4.14) started with PID=4930
[03-12-2013 21:11:38] NPCD: Please have a look at 'npcd -V' to get license information
[03-12-2013 21:11:38] NPCD: HINT: load_threshold is enabled - ('10.000000')
[03-31-2013 12:41:47] NPCD: Caught Termination Signal - Hasta la vista... baby
[03-31-2013 12:42:19] NPCD: npcd Daemon (0.4.14) started with PID=1924
[03-31-2013 12:42:19] NPCD: Please have a look at 'npcd -V' to get license information
[03-31-2013 12:42:19] NPCD: HINT: load_threshold is enabled - ('10.000000')

# ls -ld /usr/local/nagios/share/perfdata
drwxrwxr-x. 323 nagios nagios 12288 Apr 21 00:40 /usr/local/nagios/share/perfdata

# ls -l /usr/local/nagios/share/perfdata/ET_50063
total 380
-rwxrwxr-x 1 nagios nagios 384952 Apr 29 15:23 _HOST_.rrd
-rw-rw-rw- 1 nagios nagios 1890 Apr 29 15:23 _HOST_.xml