[Nagios-devel] service dependencies & recovery

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
Locked
Guest

[Nagios-devel] service dependencies & recovery

Post by Guest »

Using a lot of checks through NRPE for all hosts, we implemented a
general check_nrpe, one that just checks wether NRPE is listening on the
host, and let all the other checks using NRPE depend on this one.

Service dependency example:


define servicedependency{
host_name servername
service_description Check NRPE
dependent_service_description Load
execution_failure_criteria o
notification_failure_criteria w,u,c
}


The service dependency does work as, when the NRPE daemon goes down, we
only get a message saying that 'Check NRPE' is down. The other checks go
critical but no notification is sent out.

Problem is that when the NRPE on the host-to-be-checked is functional
again, we notice that some of the checks where a service dependency has
been applied do not recover, they remain in the 'critical' state.
The check returns to an OK-state only after we manually scheduled an
active check for it.

Nagios version we are using is 3.03
Same issues with previous versions noticed (3.01)


Anyone encountered similar issues?
Posted already on user-list, will try to do some more investigation
myself.



--
Tom De Cooman






This post was automatically imported from historical nagios-devel mailing list archives
Original poster: [email protected]
Locked