Page 1 of 2
Performance Graphs not Populating (not all servers)
Posted: Thu Apr 04, 2019 11:09 am
by blariv
Hi,
we are having trouble with some of our centos servers not showing perf graphs. all windows and most other linux systems are fine.
performance data is being shown in nagios, just no graphs. it is on all services
see attached screenshots and profile.
thank you.
Re: Performance Graphs not Populating (not all servers)
Posted: Thu Apr 04, 2019 12:20 pm
by npolovenko
Hello,
@blariv. The problem with passive checks and RRD is that if passive results don't come soon enough, the RRD tool will ignore them. The default heartbeat is 8640 seconds, meaning the data isn't inserted if there is an interval greater than 2.4 hours between points. You can verify the heartbeat settings on the databases by running:
rdtool dump /usr/local/nagios/share/perfdata/<hostname>/<servicedesc>.rrd | grep -i heart
Re: Performance Graphs not Populating (not all servers)
Posted: Thu Apr 04, 2019 12:48 pm
by blariv
<minimal_heartbeat>8460</minimal_heartbeat>
Re: Performance Graphs not Populating (not all servers)
Posted: Thu Apr 04, 2019 4:53 pm
by npolovenko
@ blariv, How often does this service check get updated with the passive result? Is the interval larger than 2 hours or smaller?
Re: Performance Graphs not Populating (not all servers)
Posted: Mon Apr 08, 2019 7:03 am
by blariv
just wanted to provide an update. its seems that when sending the passive check directly from the server we are seeing this error, therefore nothing is actually getting back to the XI server.
ERROR: The NRDP Server said
Use of uninitialized value $status in string eq at /usr/local/nrdp/clients/nrds/nrds_updater.pl line 161, <CONFIG
ERROR: Could not connect to
http://nagiosxi.xxx.com/nrdp/. Check your cfg file. at /usr/local/nrdp/clients/nrdp
Re: Performance Graphs not Populating (not all servers)
Posted: Mon Apr 08, 2019 1:06 pm
by npolovenko
@blariv, Please post all files from the following folder:
/usr/local/nrdp/clients/nrds
Also, from the client-server, please run the nmap command against the Nagios XI server and post the result.
Re: Performance Graphs not Populating (not all servers)
Posted: Wed Apr 10, 2019 10:00 am
by blariv
I seem to have fixed most of it.
looks like my nrds config on the hosts we pointing to http instead of https.
However of the 4 checks we use 3 are writing perf data.
command[Check Load]=/usr/local/nagios/libexec/check_load -w 30,25,20 -c 50,40,35
command[Check Disk]=/usr/local/nagios/libexec/check_disk -w 10% -c 5% -X nfs -X cifs
command[Check Swap]=/usr/local/nagios/libexec/check_swap -w 20% -c 10%
command[Check Zombie Procs]=/usr/local/nagios/libexec/check_procs -w 5 -c 10 -s Z
Check Disk is not.
I grabbed the info from this command.
rrdtool dump /usr/local/nagios/share/perfdata/hostname.xxx.com/Check_Disk.rrd
This shows nothing written since 2018-03-15 14:00:00 EDT the rest are showing 2019-04-10 08:00:00
Re: Performance Graphs not Populating (not all servers)
Posted: Wed Apr 10, 2019 12:29 pm
by blariv
I noticed this in the /usr/local/nagios/share/perfdata/<hostname>/<servicedesc>.rrd file for this host
<RRD>
<RC>1</RC>
<TXT>/usr/local/nagios/share/perfdata/hostname.xxx.com/Check_Disk.rrd: found extra data on update argument: 0:125</TXT>
</RRD>
Re: Performance Graphs not Populating (not all servers)
Posted: Wed Apr 10, 2019 12:30 pm
by npolovenko
@blariv, Can you move the Check_Disk.rrd and Check_Disk.xml files to a backup location and then remove them from this folder:
/usr/local/nagios/share/perfdata/
After that wait 15 minutes and check again to see if the graph started working.
Re: Performance Graphs not Populating (not all servers)
Posted: Wed Apr 10, 2019 4:05 pm
by blariv
that did it.
is there an easy way to find that string in any .xml file so i can just recreate those?