Hi all,
It's not exactly a problem with nagios itself but I'm wondering if anyone else has come across a similar issue while using Nagios. Basically, there is a Centos 7 virtual machine hosted on a ESXI host. This host has an internal IP address and monitors a bunch of servers. Nearly every day and can be multiple times a day - We start getting email alerts that services have gone down. When we jump onto the nagios box we have found that it is no longer able to ping servers that it monitors. This includes hostnames and IP addresses - The servers also have public IP addresses but are hosted within the same DC/network. The pings just time out but we are able to ping external sites such as google.com or bbc.co.uk.
When we issue a network restart - Everything starts working again until it happens again. We have checked the network configuration and all looks ok. We have tried a different 10.X.X.X IP address to see if that makes any difference, same happens. Hell we have even moved it to a different ESXI host to see if it makes a difference but it doesn't.
Anyone had anything similar or have any ideas on what could be causing it?
Can't ping monitored servers
-
- Support Tech
- Posts: 3457
- Joined: Mon May 15, 2017 5:00 pm
Re: Can't ping monitored servers
Hello, @Bertie. Most often this is a networking issue. Next time when this issue happens run the traceroute command to see where the packets are getting lost.
Where xxx is the IP address of the failing host.traceroute xx.xxx.xx
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.