Hi all,
in our NagiosXI installation (v 5.4.11) we have some performance graphs that seem stuck.
After looking in /usr/local/nagios/share/perfdata/ folder we notice that stuck services files have different mtime:
xml file is updated, rdd file is not.
In attachment you can find an example of a stuck performance graph.
What can we do to fix this situation?
We would avoid that this will happen again, so what are the possible causes?
Thank you
some performance graphs seem stuck
some performance graphs seem stuck
You do not have the required permissions to view the files attached to this post.
-
npolovenko
- Support Tech
- Posts: 3457
- Joined: Mon May 15, 2017 5:00 pm
Re: some performance graphs seem stuck
Hello, @Francesco. Seems like this graph has been stuck for quite a long time, so you can probably delete the old XML and RRD files(or move them to some backup directory). Next, please navigate to:
And run the following script:
After that run these commands:
Please allow Nagios 10-15 minutes to recreate the RRD file and start graphing. If after 15 minutes you still don't see the graph please upload these files:
Thank you.
Code: Select all
/usr/local/nagiosxi/scripts/Code: Select all
./reset_config_perms.shCode: Select all
service npcd restart
service nagios restartCode: Select all
/usr/local/nagios/var/perfdata.log
/usr/local/nagios/var/npcd.logAs of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Re: some performance graphs seem stuck
Hello @npolovenko,
we moved stuck files, graph started again without running the fix permissions script.
We want to avoid that this issue will happen again, so we did some analysis.
Using the rrdtool dump option, we compared stuck .rrd and new .rrd file:
we found that before we had 13 gauges, now we have 10.
We had a similar issue before: we added one more performance metric and graph stopped working.
We need your support or a procedure to handle these situations:
what do we have to do when we will have less or more metrics in "performance data" to not lose previous data ?
Thank you for your support
we moved stuck files, graph started again without running the fix permissions script.
We want to avoid that this issue will happen again, so we did some analysis.
Using the rrdtool dump option, we compared stuck .rrd and new .rrd file:
we found that before we had 13 gauges, now we have 10.
We had a similar issue before: we added one more performance metric and graph stopped working.
We need your support or a procedure to handle these situations:
what do we have to do when we will have less or more metrics in "performance data" to not lose previous data ?
Thank you for your support
-
npolovenko
- Support Tech
- Posts: 3457
- Joined: Mon May 15, 2017 5:00 pm
Re: some performance graphs seem stuck
This article should help:
And here's the article describing how to delete a data source, just in case:
https://support.nagios.com/kb/article/n ... e-497.html
https://support.nagios.com/kb/article/n ... g-149.htmlThe problem also exists with any performance data where additional datasources are returned with the performance data, compared to when the original performance data was received for this service.
And here's the article describing how to delete a data source, just in case:
https://support.nagios.com/kb/article/n ... e-497.html
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.