Re: [Nagios-devel] Service checks not being executed

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
Locked
Guest

Re: [Nagios-devel] Service checks not being executed

Post by Guest »

This sounds like normal behavior, but I would agree that it is
undesireable. How bad it gets depends on how many hosts go down and
the host layout (parenting) you have in your particular situation.
One thing that may help a bit is to make your host checks run faster.
This can be done by setting the max_attempts to something high (like
10) and have the host check plugin (i.e. check_ping) send out only 1
packet each time.

Also, you might want to make sure that the aggressive host checking
option is disabled in the main config file.

I'll be looking into possible ways to speed host checks up in 2.0,
but its not changing in 1.0.



On 18 Sep 2002 at 10:44, Darren Gamble wrote:

> Good day,
>
> I posted a message to the nagios-users list awhile back about a problem with
> Nagios not executing service checks- they are properly scheduled, just never
> executed. I believe I have found at least one cause of this, which
> certainly appears to be a bug, so I'm posting a message here.
>
> We recently had 7 of our 203 hosts temporarily become unreachable due to
> ACLs. When they went down, I noted that Nagios didn't check any services
> while it performed its host-check commands. As a result, almost none of the
> service checks were performed for 30-45 minutes. I presume that this is a
> bug / undesirable behavior.
>
> Furthermore, I noted that if Nagios was restarted (I am retaining state
> information, BTW), the same thing would occur- it would take 30-45 minutes
> for most service checks to run for the first time, even though Nagios
> reported that they were scheduled for execution much, much sooner than that.
> Nagios didn't log that it was trying its host checks then- I presume because
> the machines and their services were already marked as down.
>
> I loaded this configuration on a separate machine that did not have the ACL
> problem, and noted that service checks were performed normally.
>
> I hope that this is enough information to have this bug fixed. Please reply
> if there's anything else that I could supply.
>
> I'm running a late 1.0b4 CVS copy on RH 7.2 .
>
> Thanks in advance,
>
> ============================
> Darren Gamble
> Planner, Regional Services
> Shaw Cablesystems GP
> 630 - 3rd Avenue SW
> Calgary, Alberta, Canada
> T2P 4L4
> (403) 781-4948
>
>
>
> -------------------------------------------------------
> This SF.NET email is sponsored by: AMD - Your access to the experts
> on Hammer Technology! Open Source & Linux Developers, register now
> for the AMD Developer Symposium. Code: EX8664
> http://www.developwithamd.com/developerlab
> _______________________________________________
> Nagios-devel mailing list
> Nagios-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/lis ... gios-devel
>



Ethan Galstad,
Nagios Developer
---
Email: nagios@nagios.org
Website: http://www.nagios.org






This post was automatically imported from historical nagios-devel mailing list archives
Original poster: nagios@nagios.org
Locked