Page 1 of 1
some performance graphs seem stuck
Posted: Fri Jan 19, 2018 10:05 am
by Francesco
Hi all,
in our NagiosXI installation (v 5.4.11) we have some performance graphs that seem stuck.
After looking in /usr/local/nagios/share/perfdata/ folder we notice that stuck services files have different mtime:
xml file is updated, rdd file is not.
In attachment you can find an example of a stuck performance graph.
What can we do to fix this situation?
We would avoid that this will happen again, so what are the possible causes?
Thank you
check_graph.png
check_advanced.png
directory_screen.png
Re: some performance graphs seem stuck
Posted: Fri Jan 19, 2018 10:41 am
by npolovenko
Hello,
@Francesco. Seems like this graph has been stuck for quite a long time, so you can probably delete the old XML and RRD files(or move them to some backup directory). Next, please navigate to:
And run the following script:
After that run these commands:
Code: Select all
service npcd restart
service nagios restart
Please allow Nagios 10-15 minutes to recreate the RRD file and start graphing. If after 15 minutes you still don't see the graph please upload these files:
Code: Select all
/usr/local/nagios/var/perfdata.log
/usr/local/nagios/var/npcd.log
Thank you.
Re: some performance graphs seem stuck
Posted: Tue Jan 23, 2018 5:46 am
by Francesco
Hello
@npolovenko,
we moved stuck files, graph started again without running the fix permissions script.
We want to avoid that this issue will happen again, so we did some analysis.
Using the rrdtool dump option, we compared stuck .rrd and new .rrd file:
we found that before we had 13 gauges, now we have 10.
We had a similar issue before: we added one more performance metric and graph stopped working.
We need your support or a procedure to handle these situations:
what do we have to do when we will have less or more metrics in "performance data" to not lose previous data ?
Thank you for your support
Re: some performance graphs seem stuck
Posted: Tue Jan 23, 2018 10:51 am
by npolovenko
This article should help:
The problem also exists with any performance data where additional datasources are returned with the performance data, compared to when the original performance data was received for this service.
https://support.nagios.com/kb/article/n ... g-149.html
And here's the article describing how to delete a data source, just in case:
https://support.nagios.com/kb/article/n ... e-497.html