RE: [Nagios-devel] Service checks not being executed

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
Locked
Guest

RE: [Nagios-devel] Service checks not being executed

Post by Guest »

Good day,

Thanks muchly for your reply.

> This sounds like normal behavior, but I would agree that it is
> undesireable. How bad it gets depends on how many hosts go down and
> the host layout (parenting) you have in your particular situation.

Oh. That's unfortunate to hear.

> One thing that may help a bit is to make your host checks run faster.
> This can be done by setting the max_attempts to something high (like
> 10) and have the host check plugin (i.e. check_ping) send out only 1
> packet each time.

I'm using the default host plugin, which does only use the one packet.

However, please correct me if I am wrong, but wouldn't setting max_attempts
to a higher number make the situation _worse_? Since Nagios doesn't seem to
execute ANY service checks when it's doing host checking, wouldn't one want
Nagios to decide that the host is down as quickly as possible, so as to
resume its service checks?

> Also, you might want to make sure that the aggressive host checking
> option is disabled in the main config file.

It's already turned off, yeah. "agressive" is misspelled, BTW. =)

> I'll be looking into possible ways to speed host checks up in 2.0,
> but its not changing in 1.0.

I'm not sure if we're talking about the same thing here, so I'll just
confirm.

Host checking is going just fine for me- Nagios performs its host checks
exactly on schedule and determines that they're down exactly as and when it
should. The problem is that Nagios doesn't do any _service_ checks when it
does its _host_ checks. So, if I have 20 machines go down, and I have
Nagios wait for 3 minutes per host to decide that they're down, then that's
an entire hour that Nagios doesn't do any service checks for for any of the
other hosts, including those online. In my situation the hosts were
alphabetically near each other, so essentially NO service checking was done
for about 45 minutes.

Thanks again,

============================
Darren Gamble
Planner, Regional Services
Shaw Cablesystems GP
630 - 3rd Avenue SW
Calgary, Alberta, Canada
T2P 4L4
(403) 781-4948





This post was automatically imported from historical nagios-devel mailing list archives
Original poster: Darren.Gamble@sjrb.ca
Locked