Page 2 of 3

Re: host checks running uncontrolled

Posted: Thu Sep 13, 2012 12:42 am
by Mitchell
I think it still has issues (not as severe as it was). I will dig further and will post updates.

Re: host checks running uncontrolled

Posted: Thu Sep 13, 2012 10:01 am
by scottwilkerson
Thanks

Re: host checks running uncontrolled

Posted: Mon Oct 01, 2012 10:47 pm
by Mitchell
I checked with Sven Nierlein about this issue and it does not seem to be related to mod_gearman. https://groups.google.com/d/topic/mod_g ... discussion

I am going to turn off the gearman today again and will leave it to run for few days to check if it is gearman et. al?
Uncontrolled checks.docx

Re: host checks running uncontrolled

Posted: Tue Oct 02, 2012 9:56 am
by mguthrie
Yeah Sven doesn't actually like us very much. I'll look through the Core tracker and see if anyone else has reported this...

Re: host checks running uncontrolled

Posted: Tue Oct 02, 2012 10:00 am
by mguthrie
Do you guys use any event handlers to manually reschedule checks?

What version of mod gearman are you currently running?

Re: host checks running uncontrolled

Posted: Thu Oct 04, 2012 12:33 am
by Mitchell
Apologize for the delay. With nagios 3.4.1 (XI 2011R3.2), gearmand-0.25-1.i386 and mod_gearman-1.3.8-1.el6.i386.

I have the event handlers enabled but do not have any configured. I don's even have host parents configured (which could theoretically schedule additional checks?)

Regards
Ashish

Re: host checks running uncontrolled

Posted: Mon Jan 28, 2013 10:46 am
by mrochelle
I was wondering if anyone found a solution for this problem? I'm experiencing the exact same symptoms.
System:
Nagios XI Version : 2012R1.3 with Enterprise Upgrade
nagprod01.cellnet.com 2.6.32-279.11.1.el6.x86_64 x86_64
mod_gearman-1.3.8-1.e.rhel6.x86_64.rpm
gearmand-server-0.25-1.rhel6.x86_64.rpm
gearmand-devel-0.25-1.rhel6.x86_64.rpm
gearmand-0.25-1.rhel6.x86_64.rpm
CentOS release 6.3 (Final)
Gnome is not installed
Apache Information
PHP Version: 5.3.3
Agent: Mozilla/5.0 (Windows NT 5.1; rv:20.0) Gecko/20130127 Firefox/20.0
Server Name:
Server Address:
Server Port: 80
Date/Time
PHP Timezone: America/Chicago
PHP Time: Mon, 28 Jan 2013 09:29:07 -0600
System Time: Mon, 28 Jan 2013 09:29:07 -0600
Nagios XI Data
nagios (pid 13572) is running...
NPCD running (pid 2089).
ndo2db (pid 2215) is running...
CPU Load 15: 0.38
Total Hosts: 1707
Total Services: 2085

The number of available workers increase from 60 after I restart nagios. The attached snap shots are after 24 hours. Host and Service workers available are up to 100.
Average check times are 12 mins with 1 retry in 3 mins.

Re: host checks running uncontrolled

Posted: Mon Jan 28, 2013 11:45 am
by scottwilkerson
I was having a look at your image and having a little trouble seeing what the problem is.
mrochelle wrote:Host and Service workers available are up to 100.
Do you mean available workers? this may be normal depending on your setting in each /etc/mod_gearman/mod_gearman_worker.conf

Re: host checks running uncontrolled

Posted: Mon Jan 28, 2013 11:59 am
by mrochelle
Yes, let me clarify, the problem is the Active Host Checks is slowly growing over time. It will continue to grow until I restart nagios.

Re: host checks running uncontrolled

Posted: Mon Jan 28, 2013 1:09 pm
by scottwilkerson
Actually, this calculation error was corrected in 2012R1.4, however there was a different bug that was added in 1.4 that affects the new CCM and 2012R1.5 should be released this week resolving that.

If you want to install 2012R1.4 you can do the upgrade and then install the attached CCM component through Admin -> Manage Components.