Page 1 of 2

Some performance graphs not working after upgrade

Posted: Wed May 21, 2014 8:34 am
by btemple
After upgrading our Nagios XI servers to 2014 some of our performance graphs are no longer updating. The check did not change and was working before upgrade. Also can this check be run in Fahrenheit instead of Celsius ?

check_xi_service_snmp
check_snmp -H XXXXXX -o 1.3.6.1.4.1.2636.3.1.13.1.7.9.1.0.0 -C XXXXXX -P 2c -l "" -u "Deg C" -w 100 -c 120

Re: Some performance graphs not working after upgrade

Posted: Wed May 21, 2014 1:09 pm
by slansing
There does not seem to be anything inherently wrong with the performance data being returned, are your other hosts/services experiencing performance data issues as well? Or is this isolated only to this service? What is the output of:

Code: Select all

tail -50 /usr/local/nagios/var/npcd.log
tail -50 /usr/local/nagios/var/perfdata.log

Re: Some performance graphs not working after upgrade

Posted: Wed May 21, 2014 1:44 pm
by btemple
It only seems to effect that check.

Logs requested are attached

Re: Some performance graphs not working after upgrade

Posted: Wed May 21, 2014 4:39 pm
by scottwilkerson
For that particular service can you go to the advanced tab of the Service Status Detail page and show what is reporting in the "Performance Data:" field.

Thanks

Re: Some performance graphs not working after upgrade

Posted: Thu May 22, 2014 6:24 am
by btemple
Scott that info is in my first attachment I think ?

Re: Some performance graphs not working after upgrade

Posted: Thu May 22, 2014 9:36 am
by abrist
You are hitting both "MAX load" and "TIMEOUT". Do the following:
Edit:

Code: Select all

/usr/local/nagios/etc/pnp/process_perfdata.cfg
Change:

Code: Select all

TIMEOUT = 5
To:

Code: Select all

TIMEOUT = 20
Also edit this file:

Code: Select all

/usr/local/nagios/etc/pnp/npcd.cfg
Change:

Code: Select all

load_threshold = 10.0
To:

Code: Select all

load_threshold = 30.0
Now restart npcd:

Code: Select all

service npcd stop
killall -9 npcd
service npcd start

Re: Some performance graphs not working after upgrade

Posted: Thu May 22, 2014 12:03 pm
by btemple
thanks I will give this a shot

Re: Some performance graphs not working after upgrade

Posted: Thu May 22, 2014 2:09 pm
by tmcdonald
We'll keep this open for you

Re: Some performance graphs not working after upgrade

Posted: Thu May 22, 2014 6:02 pm
by Box293
I think the Unit Of Measurement contains a space (which is not allowed).

Your screenshot has

Code: Select all

=38Deg C;100;120
Nagios Plugin Development Guidelines states the following:

Code: Select all

This is the expected format:
'label'=value[UOM];[warn];[crit];[min];[max]
You can see that value and [UOM] do not have a space between them.

I beleive your command should be something like:

Code: Select all

check_snmp -H XXXXXX -o 1.3.6.1.4.1.2636.3.1.13.1.7.9.1.0.0 -C XXXXXX -P 2c -l "" -u "Deg_C" -w 100 -c 120

Re: Some performance graphs not working after upgrade

Posted: Fri May 30, 2014 9:49 am
by btemple
Ok I made the change and am running it against a router but no performance graph was created or updated. Would permissions on the directory have changed during upgrade ?