>
> 4) If a worker node fails in some way, only the then-in-flight checks
> would be lost resulting in a "(Service Check Timeout)", and when
> Nagios re-tried the check, it would then be executed on any one of the
> remaining cluster nodes.
>
> 5) Checks should not have affinity for any particular node (for the
> reason stated in #4). If local resources require that a particular
> node execute a particular check, this should be accomplished via NRPE.
>
Hi Adam,
I was already curious when I found the DNX project at sf.net if it would
be the one Bob proposed
About the two points above (especially about 5), any plan for some
"redundant" monitoring aka
having the same check executed on two geographically separated nodes and
using its "AND" result
or is that beyond the aim of that project?
Regards,
thomas
This post was automatically imported from historical nagios-devel mailing list archives
Original poster: [email protected]