Failed Graphins for individual checks

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
bomahony
Posts: 133
Joined: Wed Jul 04, 2018 10:46 am

Failed Graphins for individual checks

Post by bomahony »

Hey folks. I previously raised a ticket for failed perf-graphs:
https://support.nagios.com/forum/viewto ... 4&start=10

I am seeing the issue on and off. Seems the RRDs done update the odd time:
-rw-rw-r-- 1 nagios nagios 19548552 Mar 22 10:07 LTM___Pool___1.rrd
-rw-rw-r-- 1 nagios nagios 45330 Jul 23 11:59 LTM___Pool___1.xml

I ahd a look at https://support.nagios.com/kb/article.php?id=149 previously, but never went and did it. I don't really want to do the whole of the node, as there is 20G of perfdata across 32K files.

Is it possible to run the fix for a single rrd/xml perchance? I would rather not be constantly deleting the perfdata as we loose the history.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Failed Graphins for individual checks

Post by scottwilkerson »

You can do the same thing but for a single RRD, instead of the directory just use the full path to the rrd

e.g.

Code: Select all

./fix_ds_quantity.sh -i -d /usr/local/nagios/share/perfdata/YOURHOST/LTM___Pool___1.rrd
However, if you are seeing the issue on and off, this is likely not the problem unless you are changing the amount of datasources the plugin returns
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
bomahony
Posts: 133
Joined: Wed Jul 04, 2018 10:46 am

Re: Failed Graphins for individual checks

Post by bomahony »

TBH I havent done enough research into how often it is happening, due to workload.

I do plan to put a check in place to check for old RRD files that would help us, but i need to get time to do that :(
bomahony
Posts: 133
Joined: Wed Jul 04, 2018 10:46 am

Re: Failed Graphins for individual checks

Post by bomahony »

This didnt seem to fix my issue. I ran it for both the file, and for the folder, both with the same outcome.

```
Batch job confirmed by user.
Batch process started at Tue 23 Jul 18:08:00 BST 2019
Populating list of RRDs from the dircetory: /usr/local/nagios/share/perfdata/lb-001/LTM___Pool___1.rrd
Batch job finished at Tue 23 Jul 18:08:00 BST 2019.
A total of 0 file(s) were updated with a total of 0 datasource(s).
Changes logged to the file /tmp/fix_rrd_ds.log
```
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Failed Graphins for individual checks

Post by scottwilkerson »

bomahony wrote:This didnt seem to fix my issue. I ran it for both the file, and for the folder, both with the same outcome.
I had thought this might not be the problem
scottwilkerson wrote:However, if you are seeing the issue on and off, this is likely not the problem unless you are changing the amount of datasources the plugin returns
Where are you seeing a problem? You didn't define you actual problem, just made reference to another post.
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
bomahony
Posts: 133
Joined: Wed Jul 04, 2018 10:46 am

Re: Failed Graphins for individual checks

Post by bomahony »

Hi scott. Due to time pressure i just deleted the old data. I will spend some time in the upcoming weeks and try to track down the issue.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Failed Graphins for individual checks

Post by scottwilkerson »

bomahony wrote:Hi scott. Due to time pressure i just deleted the old data. I will spend some time in the upcoming weeks and try to track down the issue.
Sounds good.
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
bomahony
Posts: 133
Joined: Wed Jul 04, 2018 10:46 am

Re: Failed Graphins for individual checks

Post by bomahony »

You can close this for now as i am unusre when i will have the proper time to investigate
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Failed Graphins for individual checks

Post by scottwilkerson »

bomahony wrote:You can close this for now as i am unusre when i will have the proper time to investigate
Great!

Locking
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
Locked