some performance graphs seem stuck

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
User avatar
Francesco
Posts: 124
Joined: Wed Oct 28, 2015 5:02 am
Location: /usr
Contact:

some performance graphs seem stuck

Post by Francesco »

Hi all,
in our NagiosXI installation (v 5.4.11) we have some performance graphs that seem stuck.
After looking in /usr/local/nagios/share/perfdata/ folder we notice that stuck services files have different mtime:
xml file is updated, rdd file is not.

In attachment you can find an example of a stuck performance graph.

What can we do to fix this situation?
We would avoid that this will happen again, so what are the possible causes?

Thank you
check_graph.png
check_advanced.png
directory_screen.png
You do not have the required permissions to view the files attached to this post.
npolovenko
Support Tech
Posts: 3457
Joined: Mon May 15, 2017 5:00 pm

Re: some performance graphs seem stuck

Post by npolovenko »

Hello, @Francesco. Seems like this graph has been stuck for quite a long time, so you can probably delete the old XML and RRD files(or move them to some backup directory). Next, please navigate to:

Code: Select all

/usr/local/nagiosxi/scripts/
And run the following script:

Code: Select all

./reset_config_perms.sh
After that run these commands:

Code: Select all

service npcd restart
service nagios restart
Please allow Nagios 10-15 minutes to recreate the RRD file and start graphing. If after 15 minutes you still don't see the graph please upload these files:

Code: Select all

/usr/local/nagios/var/perfdata.log
/usr/local/nagios/var/npcd.log
Thank you.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
User avatar
Francesco
Posts: 124
Joined: Wed Oct 28, 2015 5:02 am
Location: /usr
Contact:

Re: some performance graphs seem stuck

Post by Francesco »

Hello @npolovenko,
we moved stuck files, graph started again without running the fix permissions script.

We want to avoid that this issue will happen again, so we did some analysis.
Using the rrdtool dump option, we compared stuck .rrd and new .rrd file:
we found that before we had 13 gauges, now we have 10.

We had a similar issue before: we added one more performance metric and graph stopped working.

We need your support or a procedure to handle these situations:
what do we have to do when we will have less or more metrics in "performance data" to not lose previous data ?

Thank you for your support
npolovenko
Support Tech
Posts: 3457
Joined: Mon May 15, 2017 5:00 pm

Re: some performance graphs seem stuck

Post by npolovenko »

This article should help:
The problem also exists with any performance data where additional datasources are returned with the performance data, compared to when the original performance data was received for this service.
https://support.nagios.com/kb/article/n ... g-149.html

And here's the article describing how to delete a data source, just in case:
https://support.nagios.com/kb/article/n ... e-497.html
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Locked