ndo2db hangs
Posted: Thu Jun 14, 2018 4:02 pm
I'm seeing frequent log entries on my Nagios XI server from ndo2db:
This results in no checks happening till ndo2db comes back (sometimes it doesn't) or is restarted. When it comes back on it's own it overloads the server I get a slew of these errors:
This setup uses mod_gearman and three remote gearman workers without any restrictions as to hostgroups or servicegroups. The remote workers sit idle then ndo2db is dead.
What can be done to fix this?
Code: Select all
Jun 14 16:56:18 ip-10-35-32-9 ndo2db: Warning: Retrying message send. This can occur because you have too few messages allowed or too few total bytes allowed in message queues. You are currently using 128000 of 32768 messages and 131072000 of 131072000 bytes in the queue. See README for kernel tuning options.
Jun 14 16:56:38 ip-10-35-32-9 ndo2db: Error: max retries exceeded sending message to queue. Kernel queue parameters may need to be tuned. See README.
Jun 14 16:56:38 ip-10-35-32-9 ndo2db: Warning: queue send error, retrying...Code: Select all
Jun 14 16:52:01 ip-10-35-32-9 nagios: #011Max concurrent service checks (2000) has been reached. Nudging $CHECK_NAMEWhat can be done to fix this?