Seeking suggestions for Parent / Child monitoring
Posted: Thu Apr 27, 2017 1:13 pm
Been experimenting recently with the Parent / Child relationship with monitoring in Nagios XI.
For example, I have a Host Hypervisor set as a Parent to a bunch of VMs running under the Hypervisor which are the Children. Nagios XI is set to check the Hypervisor (parent) as well as all the VMs (children) at the same interval. Say, every 1 minute.
As an experiment, I unplugged the network cable for the Hypervisor. The children all reported as down as did the parent.
My understanding was that the parent/child relationship was supposed to prevent this. Or rather, I thought what was supposed to happen is that only the parent was supposed to alert and the children silently set themselves to a downed state so as not to create too much "noise" in the reporting.
We considered monitoring the Hypervisor every minute and the children every 2 minutes, but wasn't sure if this was the best way to minimize the problem (as it might still be possible for the network error to be observed by the check on the children before the check on the parent.)
What's the recommended way to configure this?
Thanks!
For example, I have a Host Hypervisor set as a Parent to a bunch of VMs running under the Hypervisor which are the Children. Nagios XI is set to check the Hypervisor (parent) as well as all the VMs (children) at the same interval. Say, every 1 minute.
As an experiment, I unplugged the network cable for the Hypervisor. The children all reported as down as did the parent.
My understanding was that the parent/child relationship was supposed to prevent this. Or rather, I thought what was supposed to happen is that only the parent was supposed to alert and the children silently set themselves to a downed state so as not to create too much "noise" in the reporting.
We considered monitoring the Hypervisor every minute and the children every 2 minutes, but wasn't sure if this was the best way to minimize the problem (as it might still be possible for the network error to be observed by the check on the children before the check on the parent.)
What's the recommended way to configure this?
Thanks!