host checks running uncontrolled
Re: host checks running uncontrolled
I think it still has issues (not as severe as it was). I will dig further and will post updates.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: host checks running uncontrolled
I checked with Sven Nierlein about this issue and it does not seem to be related to mod_gearman. https://groups.google.com/d/topic/mod_g ... discussion
I am going to turn off the gearman today again and will leave it to run for few days to check if it is gearman et. al?
I am going to turn off the gearman today again and will leave it to run for few days to check if it is gearman et. al?
You do not have the required permissions to view the files attached to this post.
Re: host checks running uncontrolled
Yeah Sven doesn't actually like us very much. I'll look through the Core tracker and see if anyone else has reported this...
Re: host checks running uncontrolled
Do you guys use any event handlers to manually reschedule checks?
What version of mod gearman are you currently running?
What version of mod gearman are you currently running?
Last edited by mguthrie on Tue Oct 02, 2012 10:31 am, edited 1 time in total.
Reason: added additional questions
Reason: added additional questions
Re: host checks running uncontrolled
Apologize for the delay. With nagios 3.4.1 (XI 2011R3.2), gearmand-0.25-1.i386 and mod_gearman-1.3.8-1.el6.i386.
I have the event handlers enabled but do not have any configured. I don's even have host parents configured (which could theoretically schedule additional checks?)
Regards
Ashish
I have the event handlers enabled but do not have any configured. I don's even have host parents configured (which could theoretically schedule additional checks?)
Regards
Ashish
Re: host checks running uncontrolled
I was wondering if anyone found a solution for this problem? I'm experiencing the exact same symptoms.
System:
Nagios XI Version : 2012R1.3 with Enterprise Upgrade
nagprod01.cellnet.com 2.6.32-279.11.1.el6.x86_64 x86_64
mod_gearman-1.3.8-1.e.rhel6.x86_64.rpm
gearmand-server-0.25-1.rhel6.x86_64.rpm
gearmand-devel-0.25-1.rhel6.x86_64.rpm
gearmand-0.25-1.rhel6.x86_64.rpm
CentOS release 6.3 (Final)
Gnome is not installed
Apache Information
PHP Version: 5.3.3
Agent: Mozilla/5.0 (Windows NT 5.1; rv:20.0) Gecko/20130127 Firefox/20.0
Server Name:
Server Address:
Server Port: 80
Date/Time
PHP Timezone: America/Chicago
PHP Time: Mon, 28 Jan 2013 09:29:07 -0600
System Time: Mon, 28 Jan 2013 09:29:07 -0600
Nagios XI Data
nagios (pid 13572) is running...
NPCD running (pid 2089).
ndo2db (pid 2215) is running...
CPU Load 15: 0.38
Total Hosts: 1707
Total Services: 2085
The number of available workers increase from 60 after I restart nagios. The attached snap shots are after 24 hours. Host and Service workers available are up to 100.
Average check times are 12 mins with 1 retry in 3 mins.
System:
Nagios XI Version : 2012R1.3 with Enterprise Upgrade
nagprod01.cellnet.com 2.6.32-279.11.1.el6.x86_64 x86_64
mod_gearman-1.3.8-1.e.rhel6.x86_64.rpm
gearmand-server-0.25-1.rhel6.x86_64.rpm
gearmand-devel-0.25-1.rhel6.x86_64.rpm
gearmand-0.25-1.rhel6.x86_64.rpm
CentOS release 6.3 (Final)
Gnome is not installed
Apache Information
PHP Version: 5.3.3
Agent: Mozilla/5.0 (Windows NT 5.1; rv:20.0) Gecko/20130127 Firefox/20.0
Server Name:
Server Address:
Server Port: 80
Date/Time
PHP Timezone: America/Chicago
PHP Time: Mon, 28 Jan 2013 09:29:07 -0600
System Time: Mon, 28 Jan 2013 09:29:07 -0600
Nagios XI Data
nagios (pid 13572) is running...
NPCD running (pid 2089).
ndo2db (pid 2215) is running...
CPU Load 15: 0.38
Total Hosts: 1707
Total Services: 2085
The number of available workers increase from 60 after I restart nagios. The attached snap shots are after 24 hours. Host and Service workers available are up to 100.
Average check times are 12 mins with 1 retry in 3 mins.
You do not have the required permissions to view the files attached to this post.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: host checks running uncontrolled
I was having a look at your image and having a little trouble seeing what the problem is.
Do you mean available workers? this may be normal depending on your setting in each /etc/mod_gearman/mod_gearman_worker.confmrochelle wrote:Host and Service workers available are up to 100.
Re: host checks running uncontrolled
Yes, let me clarify, the problem is the Active Host Checks is slowly growing over time. It will continue to grow until I restart nagios.
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: host checks running uncontrolled
Actually, this calculation error was corrected in 2012R1.4, however there was a different bug that was added in 1.4 that affects the new CCM and 2012R1.5 should be released this week resolving that.
If you want to install 2012R1.4 you can do the upgrade and then install the attached CCM component through Admin -> Manage Components.
If you want to install 2012R1.4 you can do the upgrade and then install the attached CCM component through Admin -> Manage Components.
You do not have the required permissions to view the files attached to this post.