Page 1 of 1

Hostg down even if pingable

Posted: Tue Apr 29, 2014 11:22 am
by jericho_g
Hello -

We are running Nagios XI 2012R1.8 on Centos5.6. We have several hosts showing they're down even though ping checks are successful. I've even done immediate checks from the console, but the host's status stays DOWN.

-Jericho

Re: Hostg down even if pingable

Posted: Tue Apr 29, 2014 12:18 pm
by tmcdonald
What check command are you running from the command line? Can you post the full output? And what about the check configured in XI? Can you ping it manually? Output or screenshots for all of these would help.

Re: Hostg down even if pingable

Posted: Tue Apr 29, 2014 12:29 pm
by jericho_g
Output from CLI on Nagios host:

[root@clesitonag1 ~]# ping 10.106.9.1
PING 10.106.9.1 (10.106.9.1) 56(84) bytes of data.
64 bytes from 10.106.9.1: icmp_seq=1 ttl=252 time=50.5 ms
64 bytes from 10.106.9.1: icmp_seq=2 ttl=252 time=40.3 ms
64 bytes from 10.106.9.1: icmp_seq=3 ttl=252 time=40.4 ms
64 bytes from 10.106.9.1: icmp_seq=4 ttl=252 time=39.5 ms
64 bytes from 10.106.9.1: icmp_seq=5 ttl=252 time=80.6 ms
64 bytes from 10.106.9.1: icmp_seq=6 ttl=252 time=93.4 ms
64 bytes from 10.106.9.1: icmp_seq=7 ttl=252 time=37.7 ms
64 bytes from 10.106.9.1: icmp_seq=8 ttl=252 time=38.1 ms
64 bytes from 10.106.9.1: icmp_seq=9 ttl=252 time=74.4 ms
64 bytes from 10.106.9.1: icmp_seq=10 ttl=252 time=50.2 ms
64 bytes from 10.106.9.1: icmp_seq=11 ttl=252 time=96.2 ms
64 bytes from 10.106.9.1: icmp_seq=12 ttl=252 time=69.4 ms
64 bytes from 10.106.9.1: icmp_seq=13 ttl=252 time=95.7 ms
64 bytes from 10.106.9.1: icmp_seq=14 ttl=252 time=40.4 ms
64 bytes from 10.106.9.1: icmp_seq=15 ttl=252 time=95.9 ms
^C
--- 10.106.9.1 ping statistics ---
15 packets transmitted, 15 received, 0% packet loss, time 14664ms
rtt min/avg/max/mdev = 37.705/62.908/96.258/23.568 ms

Screenshots from Nagios web console:
Host status-10.106.9.1.jpg
ping-output-10.106.9.1.jpg

Re: Hostg down even if pingable

Posted: Tue Apr 29, 2014 12:33 pm
by tmcdonald
And can you post a screenshot or otherwise provide the configuration for the host in the CCM? I want to see the check and the arguments that are being run.

Re: Hostg down even if pingable

Posted: Tue Apr 29, 2014 12:37 pm
by jericho_g
Here is the config for this host from /usr/local/nagios/etc/hosts:

define host {
host_name iserallmpls1.advanstar.com
use xiwizard_switch_host
alias ISE MPLS router
address 10.106.9.1
parents clerallmpls1.cle.advanstar.com
max_check_attempts 5
check_interval 5
retry_interval 1
check_period xi_timeperiod_24x7
contacts ADV_Alert_ISE_MPLS_site_down
notification_interval 60
notification_period 24x7
notification_options d,u,r,s,
notifications_enabled 1
icon_image switch.png
statusmap_image switch.png
_xiwizard switch
register 1
}

Re: Hostg down even if pingable

Posted: Tue Apr 29, 2014 4:54 pm
by abrist
Did you have time/date issues or db problems recently? The last check on that host occurred yesterday.
If you schedule an immediate check from the details tab in the UI, does it get checked immediately?
Check to make sure that both dates are correct in Admin --> System Profile.

Re: Hostg down even if pingable

Posted: Wed Apr 30, 2014 3:29 pm
by jericho_g
Confirmed system time is correct. Scheduling immediate checks, completes successfully every time. Status still shows down and critical:
CRITICAL - 10.106.9.1: rta nan, lost 100%

Re: Hostg down even if pingable

Posted: Wed Apr 30, 2014 4:56 pm
by abrist
Perform the steps from the following document:
http://library.nagios.com/library/produ ... ystem-time
And then report back.
You will most likely need to restart nagios and then schedule immediate checks.