Inaccurate Performance Graph

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
conston_rd
Posts: 14
Joined: Tue Nov 27, 2018 4:27 am

Inaccurate Performance Graph

Post by conston_rd »

Hi,

We are running nagiosxi 5.6.6 on centOS 7.
core version is 4.4.3.

we are using NCPA agent collecting metrics through active chcecks.


Today there was a server outage, because of filesystem full. The server was down for more than 20 minutes.
I see the "servicecheck timeout" errors in nagios event log.

Interestingly the performance graph shows performance data even for the host down period, which is not correct.
we would like to get this fixed, could you please check and help to resolve this issue.

rrd dump on the performance data shows data for teh host down period.
I have attached rrd file, nagios.log extract for the specific server
also attached the screen grab of the performance graph.
You do not have the required permissions to view the files attached to this post.
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: Inaccurate Performance Graph

Post by benjaminsmith »

Hello,

Thank you for posting the screenshot and Nagios log, very helpful. When the server is down there is no plugin data being generated, so the RRD tool that creates the graphs is plotting the line from one data point to next.

Code: Select all

[1587452400] CURRENT SERVICE STATE: p1l00462g;Root Volume;OK;HARD;1;OK: Used_percent was 78.00 %
[1587462217] SERVICE ALERT: p1l00462g;Root Volume;WARNING;SOFT;1;WARNING: Used_percent was 84.30 %
[1587462337] SERVICE ALERT: p1l00462g;Root Volume;WARNING;SOFT;2;WARNING: Used_percent was 84.30 %
[1587462395] SERVICE ALERT: p1l00462g;Root Volume;WARNING;HARD;3;WARNING: Used_percent was 84.30 %
[1587464056] SERVICE NOTIFICATION: constond;p1l00462g;Root Volume;WARNING;notify-service-by-email;WARNING: Used_percent was 84.30 %
[1587464056] SERVICE NOTIFICATION: servicenow_integration;p1l00462g;Root Volume;WARNING;notify_servicenow_service;WARNING: Used_percent was 84.30 %
[1587465950] SERVICE NOTIFICATION: constond;p1l00462g;Root Volume;OK;notify-service-by-email;OK: Used_percent was 63.90 %
[1587465950] SERVICE NOTIFICATION: servicenow_integration;p1l00462g;Root Volume;OK;notify_servicenow_service;OK: Used_percent was 63.90 %
[1587465950] SERVICE ALERT: p1l00462g;Root Volume;OK;HARD;3;OK: Used_percent was 63.90 %
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
conston_rd
Posts: 14
Joined: Tue Nov 27, 2018 4:27 am

Re: Inaccurate Performance Graph

Post by conston_rd »

Thank you for the response.

so the RRD tool that creates the graphs is plotting the line from one data point to next --> but why do i see data in rrd file ? when there is no data received from plugin, rrd file should not have the data , is my understanding correct?

won't this result in inaccurate graph data?

is there a way to know the intervals when there was no data collected ?


Regards,
Conston
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: Inaccurate Performance Graph

Post by benjaminsmith »

Hi Conston,

That is correct, RRDtool is filling in the missing data points in order to plot the performance graph. Currently, there is not an option to change this, however, we are planning to make significant improvements to the performance graphs in Nagios XI 6.

See: https://www.nagios.com/roadmaps/
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
Locked