we have a strange problem in our environment.
We are using the standard wizard for adding new network switches. This wizard creates a MRTG config file, and a corresponding
service with check_rrdtraf, which checks for the MRTG rrd file.
We have the problem that the bandwidth graph from time to time show up a glitch, which doesnt make any sense:
Here one example:
I traced the problem down to the following:
This is the content of the MRTG rrd file:
1371103500: 6.2738031371e+03 8.0207115058e+03
1371103800: 4.2759980537e+03 7.2949858804e+03
1371104100: 4.2759980537e+03 7.2949858804e+03
1371104400: 4.7058647467e+03 7.7942457101e+03
1371104700: 5.9542864011e+03 1.0424203964e+04
An this is the corresponding content of the check_rrdtraf, rrd file:
1371103860: 4.9902362900e+04 6.3743967412e+04
1371103920: 4.9902362900e+04 6.3743967412e+04
1371103980: 1.0038085019e+03 1.2833138409e+03
1371104040: 1.0038085019e+03 1.2833138409e+03
1371104100: 1.0038085019e+03 1.2833138409e+03
1371104160: 1.0038085019e+03 1.2833138409e+03
1371104220: 1.0038085019e+03 1.2833138409e+03
1371104280: 3.3523824741e+04 5.7192689302e+04
1371104340: 3.3523824741e+04 5.7192689302e+04
1371104400: 3.3523824741e+04 5.7192689302e+04
The glitch happend because of the values form: 1371103980 to 1371104220
As you can see the values form 1371103800 to 1371104100 in the MRTG rrd file didnt drop, but the ones in the check_rrdtraf, rrd file do!
I investigated further and found maybe a hint:
In the check_rrdtraf script, a command is excuted for the check of the MRTG rrd file:
It's this line here:
Code: Select all
DATASET=`rrdtool fetch $FILE AVERAGE -s-10minutes| grep -vi "nan"`
Here is an example:
Code: Select all
rrdtool fetch /var/lib/mrtg/example.rrd AVERAGE -s-10minutes | grep -vi "nan"
ds0 ds1
1371112500: 4.4748936875e+03 9.0650277233e+03
1371112800: 4.0733062489e+03 8.3976221581e+03
==========================================================================
rrdtool fetch /var/lib/mrtg/example.rrd AVERAGE -s-10minutes | grep -vi "nan"
ds0 ds1
1371112800: 4.0733062489e+03 8.3976221581e+03
Ever seen this kind of problem?
Kind regards
Dominik