rrdcached Problem
Posted: Wed Jun 11, 2014 10:18 am
Nagios graphing has a problem which just started. It looks like graphs are now building about an hour behind. The server has plenty of CPU (16 cores), 32 GB of RAM, resources do not look like an issue.
/var/log/messages
Jun 11 10:02:44 nag rrdcached[5586]: queue_thread_main: rrd_update_r (/app01/perfdata/extn/Check_Disk.rrd) failed with status -1. (/app01/perfdata/extn/Check_Disk.rrd: expected 45 data source readings (got 36) from 1402497763)
Jun 11 10:03:45 nag rrdcached[5586]: queue_thread_main: rrd_update_r (/app01/perfdata/cmin/Check_Disk.rrd) failed with status -1. (/app01/perfdata/cmin/Check_Disk.rrd: found extra data on update argument: 445185)
Early this morning I saw a huge spike in the ramdisk size which probably relates to this problem. That has been resolved and returned to normal.
CentOs 6.4 64_bit
rrdcached is used
rrdcached has been restarted with no errors
nagios restarted
npcd restarted (nothing significant in log)
/var/log/messages
Jun 11 10:02:44 nag rrdcached[5586]: queue_thread_main: rrd_update_r (/app01/perfdata/extn/Check_Disk.rrd) failed with status -1. (/app01/perfdata/extn/Check_Disk.rrd: expected 45 data source readings (got 36) from 1402497763)
Jun 11 10:03:45 nag rrdcached[5586]: queue_thread_main: rrd_update_r (/app01/perfdata/cmin/Check_Disk.rrd) failed with status -1. (/app01/perfdata/cmin/Check_Disk.rrd: found extra data on update argument: 445185)
Early this morning I saw a huge spike in the ramdisk size which probably relates to this problem. That has been resolved and returned to normal.
CentOs 6.4 64_bit
rrdcached is used
rrdcached has been restarted with no errors
nagios restarted
npcd restarted (nothing significant in log)