NSCA: big delay delay processing servicechecks

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
User avatar
sebastiaopburnay
Posts: 105
Joined: Sun Oct 31, 2010 1:40 pm
Location: Lisbon, Portugal

Re: NSCA: big delay delay processing servicechecks

Post by sebastiaopburnay »

slansing wrote:Hmm, okay so this is happening 24/7 correct? This error with processing the check results, and also the large latency between the received result, and it actually being displayed in the cgi's.

Are you noticing a large influx of files in:

Code: Select all

/var/spool/checkresults
Yes, it happens all the time
That folder (in my case /usr/local/nagios/var/spool/checkresults has 35 files with strange names, but all (except two) have a size of zero.
slansing wrote: Or piling up in:

Code: Select all

/tmp
The only two folders on my filesystem named tmp are /tmp and /var/tmp and they are both empty...

Anyway, I've been doing some further analysis to log files and I've found out entries on syslog like:

Code: Select all

Feb 20 20:39:23 SRVCARTMSCORE01 ndo2db: Message sent to queue.
Feb 20 20:39:23 SRVCARTMSCORE01 ndo2db: Warning: queue send error, retrying...
That lead me to posts in other forums such as:
1 - http://www.monitoring-portal.org/wbb/in ... adID=25673,
2 - http://support.nagios.com/forum/viewtop ... 43&p=84026 and
3 - http://support.nagios.com/wiki/index.ph ... 3.x_Issues

Rhey suggest that ndomod might unable Nagios' to do its work.

while 1st post just suggests downgrading NDOUtils,
Posts 1 and 2 suggest it has something to do with settings on /etc/sysctl.conf, but I'm not comfortable enough to do those changes without understanding them
User avatar
lmiltchev
Bugs find me
Posts: 13589
Joined: Mon May 23, 2011 12:15 pm

Re: NSCA: big delay delay processing servicechecks

Post by lmiltchev »

Run the following commands and show us the output:

Code: Select all

grep "for i in" /etc/init.d/nagios
cat /etc/sysctl.conf
Be sure to check out our Knowledgebase for helpful articles and solutions!
User avatar
sebastiaopburnay
Posts: 105
Joined: Sun Oct 31, 2010 1:40 pm
Location: Lisbon, Portugal

Re: NSCA: big delay delay processing servicechecks

Post by sebastiaopburnay »

lmiltchev wrote:Run the following commands and show us the output:

Code: Select all

grep "for i in" /etc/init.d/nagios

Code: Select all

 for i in 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30; do
lmiltchev wrote: cat /etc/sysctl.conf[/code]
/etc/sysctl.conf is all commented entries

Code: Select all

root@MyServer:~#  grep ^[^#] /etc/sysctl.conf 
 
 
 
root@MyServer:~# 
With further investigation of syslog and consultation of NDOUtils README file, I've defined the following values:

Code: Select all

root@myServer:~# sysctl -p
# msgmnb - product of (msgmax  * msgmni )
kernel.msgmnb = 393740288
# msgmax - suggested by NDOUtils README file
kernel.msgmax = 65536
# msgmni - left as it was
kernel.msgmni = 6008
So far I've detected no ndomod error/warning messages on syslog. And the servicechecks seem to be synched :).
Yet, I want to observe teh server's behaviour for a few days prior to closing this post.
sreinhardt
-fno-stack-protector
Posts: 4366
Joined: Mon Nov 19, 2012 12:10 pm

Re: NSCA: big delay delay processing servicechecks

Post by sreinhardt »

Cool, let us know if a few days how it continues to go!
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.
User avatar
sebastiaopburnay
Posts: 105
Joined: Sun Oct 31, 2010 1:40 pm
Location: Lisbon, Portugal

[SOLVED] NSCA: big delay delay processing servicechecks

Post by sebastiaopburnay »

The kernel queue values have definitely solved those errors I was facing.

I do not recall how to set as solved, but I've marked it with a [v] for success :D

Thank you all for the time and knowledge shared.

This post will serve as knowlege article for my future implmenmtations
Locked