We have our hosts set up with a slightly modified check-host-alive (which warns at 300 ms / 20% packet loss and goes critical at 500 ms / 50% packet loss) with check_interval = 5, retry_interval = 1, max_check_attempts = 2.
Code: Select all
$USER1$/check_icmp -H $HOSTADDRESS$ -w 300.0,20% -c 500.0,50% -p 5I am asking this as we see value in capturing packet loss at a more granular level (ideally as these are data center hosts we'd like to alert on ANY packet loss, but don't want to ping the hosts hundreds of times on each check to catch small levels of packet loss)
Thanks