State of hosts with passive checks
Posted: Thu Sep 18, 2014 2:39 pm
Hi all,
I recently implemented a passive checks schema on a remote site. In this scenario I have a host with dynamic IP (accessible via DynDNS domain) which performs the checks on other hosts of the local network and sends the results to the Nagios server via NSCA.
Here I have observed the following situation: if this host is turned off, the Nagios server returns a message for services (in the same host and all who monitors): "WARNING: Did not receive service status report for a long time!" using check_freshness, freshness_threshold and no_report_warn command in the service definition:
Well, this is what I expected to happen with the services if the host is down. But I'm also noting that both the host turned off as those who are passively monitored by this, they are all displayed in Nagios with "Up". The only explanation I can think specifically for the host turned off is that the IP which had this server is now used by another client of DynDNS and so check_ping not fail for this server.
But I'm not sure what can be the cause for the hosts monitored passively appear as "Up". I think I had tried to use check_host_freshness and host_freshness_check_interval in the host definition with no_report_warn on the check_command, but it did not produce any change.
Best regards,
Daniel
I recently implemented a passive checks schema on a remote site. In this scenario I have a host with dynamic IP (accessible via DynDNS domain) which performs the checks on other hosts of the local network and sends the results to the Nagios server via NSCA.
Here I have observed the following situation: if this host is turned off, the Nagios server returns a message for services (in the same host and all who monitors): "WARNING: Did not receive service status report for a long time!" using check_freshness, freshness_threshold and no_report_warn command in the service definition:
Code: Select all
define command{
command_name no_report_warn
command_line $USER1$/no_report_warn.sh
}
Code: Select all
#!/bin/bash
# file: /usr/local/nagios/libexec/no_report_warn.sh
echo "WARNING: Did not receive service status report for a long time!";
exit 1;
But I'm not sure what can be the cause for the hosts monitored passively appear as "Up". I think I had tried to use check_host_freshness and host_freshness_check_interval in the host definition with no_report_warn on the check_command, but it did not produce any change.
Best regards,
Daniel