Hey folks. I previously raised a ticket for failed perf-graphs:
https://support.nagios.com/forum/viewto ... 4&start=10
I am seeing the issue on and off. Seems the RRDs done update the odd time:
-rw-rw-r-- 1 nagios nagios 19548552 Mar 22 10:07 LTM___Pool___1.rrd
-rw-rw-r-- 1 nagios nagios 45330 Jul 23 11:59 LTM___Pool___1.xml
I ahd a look at https://support.nagios.com/kb/article.php?id=149 previously, but never went and did it. I don't really want to do the whole of the node, as there is 20G of perfdata across 32K files.
Is it possible to run the fix for a single rrd/xml perchance? I would rather not be constantly deleting the perfdata as we loose the history.
Failed Graphins for individual checks
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Failed Graphins for individual checks
You can do the same thing but for a single RRD, instead of the directory just use the full path to the rrd
e.g.
However, if you are seeing the issue on and off, this is likely not the problem unless you are changing the amount of datasources the plugin returns
e.g.
Code: Select all
./fix_ds_quantity.sh -i -d /usr/local/nagios/share/perfdata/YOURHOST/LTM___Pool___1.rrd
Re: Failed Graphins for individual checks
TBH I havent done enough research into how often it is happening, due to workload.
I do plan to put a check in place to check for old RRD files that would help us, but i need to get time to do that
I do plan to put a check in place to check for old RRD files that would help us, but i need to get time to do that
Re: Failed Graphins for individual checks
This didnt seem to fix my issue. I ran it for both the file, and for the folder, both with the same outcome.
```
Batch job confirmed by user.
Batch process started at Tue 23 Jul 18:08:00 BST 2019
Populating list of RRDs from the dircetory: /usr/local/nagios/share/perfdata/lb-001/LTM___Pool___1.rrd
Batch job finished at Tue 23 Jul 18:08:00 BST 2019.
A total of 0 file(s) were updated with a total of 0 datasource(s).
Changes logged to the file /tmp/fix_rrd_ds.log
```
```
Batch job confirmed by user.
Batch process started at Tue 23 Jul 18:08:00 BST 2019
Populating list of RRDs from the dircetory: /usr/local/nagios/share/perfdata/lb-001/LTM___Pool___1.rrd
Batch job finished at Tue 23 Jul 18:08:00 BST 2019.
A total of 0 file(s) were updated with a total of 0 datasource(s).
Changes logged to the file /tmp/fix_rrd_ds.log
```
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Failed Graphins for individual checks
I had thought this might not be the problembomahony wrote:This didnt seem to fix my issue. I ran it for both the file, and for the folder, both with the same outcome.
Where are you seeing a problem? You didn't define you actual problem, just made reference to another post.scottwilkerson wrote:However, if you are seeing the issue on and off, this is likely not the problem unless you are changing the amount of datasources the plugin returns
Re: Failed Graphins for individual checks
Hi scott. Due to time pressure i just deleted the old data. I will spend some time in the upcoming weeks and try to track down the issue.
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Failed Graphins for individual checks
Sounds good.bomahony wrote:Hi scott. Due to time pressure i just deleted the old data. I will spend some time in the upcoming weeks and try to track down the issue.
Re: Failed Graphins for individual checks
You can close this for now as i am unusre when i will have the proper time to investigate
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Failed Graphins for individual checks
Great!bomahony wrote:You can close this for now as i am unusre when i will have the proper time to investigate
Locking