host checks running uncontrolled

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
User avatar
Mitchell
Posts: 130
Joined: Thu Jan 05, 2012 2:33 am

Re: host checks running uncontrolled

Post by Mitchell »

I think it still has issues (not as severe as it was). I will dig further and will post updates.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: host checks running uncontrolled

Post by scottwilkerson »

Thanks
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
User avatar
Mitchell
Posts: 130
Joined: Thu Jan 05, 2012 2:33 am

Re: host checks running uncontrolled

Post by Mitchell »

I checked with Sven Nierlein about this issue and it does not seem to be related to mod_gearman. https://groups.google.com/d/topic/mod_g ... discussion

I am going to turn off the gearman today again and will leave it to run for few days to check if it is gearman et. al?
Uncontrolled checks.docx
You do not have the required permissions to view the files attached to this post.
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: host checks running uncontrolled

Post by mguthrie »

Yeah Sven doesn't actually like us very much. I'll look through the Core tracker and see if anyone else has reported this...
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: host checks running uncontrolled

Post by mguthrie »

Do you guys use any event handlers to manually reschedule checks?

What version of mod gearman are you currently running?
Last edited by mguthrie on Tue Oct 02, 2012 10:31 am, edited 1 time in total.
Reason: added additional questions
User avatar
Mitchell
Posts: 130
Joined: Thu Jan 05, 2012 2:33 am

Re: host checks running uncontrolled

Post by Mitchell »

Apologize for the delay. With nagios 3.4.1 (XI 2011R3.2), gearmand-0.25-1.i386 and mod_gearman-1.3.8-1.el6.i386.

I have the event handlers enabled but do not have any configured. I don's even have host parents configured (which could theoretically schedule additional checks?)

Regards
Ashish
User avatar
mrochelle
Posts: 238
Joined: Fri May 04, 2012 11:20 am
Location: Heart of America

Re: host checks running uncontrolled

Post by mrochelle »

I was wondering if anyone found a solution for this problem? I'm experiencing the exact same symptoms.
System:
Nagios XI Version : 2012R1.3 with Enterprise Upgrade
nagprod01.cellnet.com 2.6.32-279.11.1.el6.x86_64 x86_64
mod_gearman-1.3.8-1.e.rhel6.x86_64.rpm
gearmand-server-0.25-1.rhel6.x86_64.rpm
gearmand-devel-0.25-1.rhel6.x86_64.rpm
gearmand-0.25-1.rhel6.x86_64.rpm
CentOS release 6.3 (Final)
Gnome is not installed
Apache Information
PHP Version: 5.3.3
Agent: Mozilla/5.0 (Windows NT 5.1; rv:20.0) Gecko/20130127 Firefox/20.0
Server Name:
Server Address:
Server Port: 80
Date/Time
PHP Timezone: America/Chicago
PHP Time: Mon, 28 Jan 2013 09:29:07 -0600
System Time: Mon, 28 Jan 2013 09:29:07 -0600
Nagios XI Data
nagios (pid 13572) is running...
NPCD running (pid 2089).
ndo2db (pid 2215) is running...
CPU Load 15: 0.38
Total Hosts: 1707
Total Services: 2085

The number of available workers increase from 60 after I restart nagios. The attached snap shots are after 24 hours. Host and Service workers available are up to 100.
Average check times are 12 mins with 1 retry in 3 mins.
You do not have the required permissions to view the files attached to this post.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: host checks running uncontrolled

Post by scottwilkerson »

I was having a look at your image and having a little trouble seeing what the problem is.
mrochelle wrote:Host and Service workers available are up to 100.
Do you mean available workers? this may be normal depending on your setting in each /etc/mod_gearman/mod_gearman_worker.conf
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
User avatar
mrochelle
Posts: 238
Joined: Fri May 04, 2012 11:20 am
Location: Heart of America

Re: host checks running uncontrolled

Post by mrochelle »

Yes, let me clarify, the problem is the Active Host Checks is slowly growing over time. It will continue to grow until I restart nagios.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: host checks running uncontrolled

Post by scottwilkerson »

Actually, this calculation error was corrected in 2012R1.4, however there was a different bug that was added in 1.4 that affects the new CCM and 2012R1.5 should be released this week resolving that.

If you want to install 2012R1.4 you can do the upgrade and then install the attached CCM component through Admin -> Manage Components.
You do not have the required permissions to view the files attached to this post.
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
Locked