Page 2 of 2
Re: Check_Host_Alive timed out
Posted: Fri Jul 04, 2014 4:18 am
by ITSLR
The funny thing is that i can monitor the virtual machines on the host but not the host itself...
Re: Check_Host_Alive timed out
Posted: Fri Jul 04, 2014 10:28 am
by eloyd
Has the host ever been able to be monitored and this problem just started, or has it never been able to be monitored?
Re: Check_Host_Alive timed out
Posted: Fri Jul 04, 2014 10:31 am
by ITSLR
It worked for some days and suddenly during the day (without any restart or config-change) the host has gone "offline" and never came back up.
Re: Check_Host_Alive timed out
Posted: Fri Jul 04, 2014 10:35 am
by eloyd
You've stumped me. I got nothing. I'm going to think about this as I burn some hamburgers and drink some micro brews.
Re: Check_Host_Alive timed out
Posted: Fri Jul 04, 2014 8:32 pm
by Box293
Have you tried rebooting the Nagios server? I always like a fresh boot when troubleshooting odd problems.
Re: Check_Host_Alive timed out
Posted: Mon Jul 07, 2014 3:52 am
by V3xX
Hi,
I got your difference between the nagios check and your manual check.
# 'check-host-alive' command definition
define command{
command_name check-host-alive
command_line $USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c 5000.0,100% -p 5
}
--> you use the IP-Address here
[nagios@kwapp001 ~]$ /usr/lib64/nagios/plugins/check_ping -H vmware001 -w 3000.0,80% -c 5000.0,100% -p 5 -4
PING OK - Packet loss = 0%, RTA = 0.40 ms|rta=0.400000ms;3000.000000;5000.000000;0.000000 pl=0%;80;100;0
--> you use the hostname here
[nagios@kwapp001 ~]$ ping vmware001
PING vmware001 (192.168.34.176) 56(84) bytes of data.
64 bytes from vmware001 (192.168.34.176): icmp_seq=1 ttl=64 time=0.201 ms
64 bytes from vmware001 (192.168.34.176): icmp_seq=2 ttl=64 time=0.369 ms
64 bytes from vmware001 (192.168.34.176): icmp_seq=3 ttl=64 time=0.372 ms
--> hostname too
maybe there is an error in your host-config file?
Re: Check_Host_Alive timed out
Posted: Mon Jul 07, 2014 3:58 am
by ITSLR
@Box293
Do you mean restarting the nagios deamon or the complete server? I did a restart on the nagios deamon which didn't work. Unfortunately I can't restart the machine because it hosts a lot more services than the nagios so I have to try this sometime
@V3xX
Manual test works fine with Hostname and IP
[nagios@kwapp001 ~]$ /usr/lib64/nagios/plugins/check_ping -H 192.168.34.176 -w 3000.0,80% -c 5000.0,100% -p 5
PING OK - Packet loss = 0%, RTA = 0.41 ms|rta=0.415000ms;3000.000000;5000.000000;0.000000 pl=0%;80;100;0
[nagios@kwapp001 ~]$ /usr/lib64/nagios/plugins/check_ping -H vmware001 -w 3000.0,80% -c 5000.0,100% -p 5
PING OK - Packet loss = 0%, RTA = 0.36 ms|rta=0.357000ms;3000.000000;5000.000000;0.000000 pl=0%;80;100;0
the config also contains the IP
define host{
use generic-hosts
host_name vmware001
alias VMWARE001
address 192.198.34.176
}
Re: Check_Host_Alive timed out
Posted: Mon Jul 07, 2014 5:02 am
by V3xX
hi,
if this was copy and paste from host config, i found your error:
define host{
use generic-hosts
host_name vmware001
alias VMWARE001
address 192.198.34.176
}
Re: Check_Host_Alive timed out
Posted: Mon Jul 07, 2014 6:40 am
by eloyd
if this was copy and paste from host config, i found your error:
Nice job. Funny how you miss the simple stuff.

Re: Check_Host_Alive timed out
Posted: Mon Jul 07, 2014 9:31 am
by ITSLR
Realy!?! Just a typo?!?!?
Thank you all for your help.