ndo2db Hogging ALL the CPU
Posted: Thu Aug 21, 2014 6:12 pm
ndo2db is hogging all the CPU
Here is a line from top:
8022 nagios 20 0 51056 2324 1040 R 100.0 0.0 8:08.14 ndo2db
This is now about 80% of the time and nothing seems to impact it whether Nagios is restarted or mysqld, etc.
The Monitoring Engine Process seems to slowly die as the Event Queue keeps building until all checks are trying to execute at once. If I restart the monitoring engine it levels for 30 seconds.
No indication in logs of any issues.
System:
Redhat 6.5 64_bit
XI 2014R1.4 (did it in 1.3 also)
6433 MB of RAM free
CPU idle 72%
Load 1.6, 1.63, 1.38
I have checked the database tables and they are OK.
I have run /usr/local/nagiosxi/scripts/repairmysql.sh nagios no issues.
I have updated ulimits:
nagios hard memlock 128
nagios soft memlock 128
nagios hard nproc 4096
nagios soft nproc 4096
root hard memlock 128
root soft memlock 128
root hard nproc 4096
root soft nproc 4096
The server was restarted.
/etc/sysctl.conf modified:
# Controls the default maxmimum size of a mesage queue
kernel.msgmnb = 131072000
# Controls the maximum size of a message, in bytes
kernel.msgmax = 131072000
# Controls the maximum shared segment size, in bytes
kernel.shmmax = 4294967295
# Controls the maximum number of shared memory segments, in pages
kernel.shmall = 268435456
Here is a line from top:
8022 nagios 20 0 51056 2324 1040 R 100.0 0.0 8:08.14 ndo2db
This is now about 80% of the time and nothing seems to impact it whether Nagios is restarted or mysqld, etc.
The Monitoring Engine Process seems to slowly die as the Event Queue keeps building until all checks are trying to execute at once. If I restart the monitoring engine it levels for 30 seconds.
No indication in logs of any issues.
System:
Redhat 6.5 64_bit
XI 2014R1.4 (did it in 1.3 also)
6433 MB of RAM free
CPU idle 72%
Load 1.6, 1.63, 1.38
I have checked the database tables and they are OK.
I have run /usr/local/nagiosxi/scripts/repairmysql.sh nagios no issues.
I have updated ulimits:
nagios hard memlock 128
nagios soft memlock 128
nagios hard nproc 4096
nagios soft nproc 4096
root hard memlock 128
root soft memlock 128
root hard nproc 4096
root soft nproc 4096
The server was restarted.
/etc/sysctl.conf modified:
# Controls the default maxmimum size of a mesage queue
kernel.msgmnb = 131072000
# Controls the maximum size of a message, in bytes
kernel.msgmax = 131072000
# Controls the maximum shared segment size, in bytes
kernel.shmmax = 4294967295
# Controls the maximum number of shared memory segments, in pages
kernel.shmall = 268435456