Can't ping monitored servers

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
Locked
Bertie
Posts: 1
Joined: Wed Nov 23, 2016 8:00 am

Can't ping monitored servers

Post by Bertie »

Hi all,

It's not exactly a problem with nagios itself but I'm wondering if anyone else has come across a similar issue while using Nagios. Basically, there is a Centos 7 virtual machine hosted on a ESXI host. This host has an internal IP address and monitors a bunch of servers. Nearly every day and can be multiple times a day - We start getting email alerts that services have gone down. When we jump onto the nagios box we have found that it is no longer able to ping servers that it monitors. This includes hostnames and IP addresses - The servers also have public IP addresses but are hosted within the same DC/network. The pings just time out but we are able to ping external sites such as google.com or bbc.co.uk.

When we issue a network restart - Everything starts working again until it happens again. We have checked the network configuration and all looks ok. We have tried a different 10.X.X.X IP address to see if that makes any difference, same happens. Hell we have even moved it to a different ESXI host to see if it makes a difference but it doesn't.

Anyone had anything similar or have any ideas on what could be causing it?
npolovenko
Support Tech
Posts: 3457
Joined: Mon May 15, 2017 5:00 pm

Re: Can't ping monitored servers

Post by npolovenko »

Hello, @Bertie. Most often this is a networking issue. Next time when this issue happens run the traceroute command to see where the packets are getting lost.
traceroute xx.xxx.xx
Where xxx is the IP address of the failing host.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Locked