Page 1 of 1
Hard State
Posted: Tue Jan 21, 2014 1:50 pm
by rentsys
Recently this has been happening
Code: Select all
2014-01-19 07:38:13 ppmon drac DOWN HARD 1 of 7 CRITICAL - 10.148.148.18: rta nan, lost 100%
it skips soft state and goes straight to hard state. what would cause this? It doesn't happen every time.
Re: Hard State
Posted: Tue Jan 21, 2014 1:57 pm
by lmiltchev
Can you post the host/service/template definition? Hide the sensitive info.
Re: Hard State
Posted: Tue Jan 21, 2014 2:03 pm
by rentsys
Code: Select all
###############################################################################
#
# Host configuration file
#
# Created by: Nagios QL Version 3.0.3
# Date: 2014-01-21 12:09:53
# Version: Nagios 3.x config file
#
# --- DO NOT EDIT THIS FILE BY HAND ---
# Nagios QL will overwite all manual settings during the next update
#
###############################################################################
define host {
host_name ppmon drac
use xiwizard_generic_host
alias alias
address xx.xxx.xxx.xx
parents xxxxxxxxxxx
max_check_attempts 7
check_interval 10
retry_interval 2
check_period xi_timeperiod_24x
contact_groups admins
notification_interval 60
notification_period xi_timeperiod_24x
_xiwizard autodiscovery
register 1
}
###############################################################################
#
# Host configuration file
#
# END OF FILE
#
###############################################################################
Re: Hard State
Posted: Tue Jan 21, 2014 2:51 pm
by rentsys
I have a back up server and it monitors the same devices. It had the right check.
Code: Select all
2014-01-19 02:40:50 ppmon drac DOWN SOFT 1 of 7 CRITICAL - 10.148.148.18: rta nan, lost 100%
Re: Hard State
Posted: Tue Jan 21, 2014 4:14 pm
by slansing
Do you passively updating the host? That is the only thing I can think of immediately that would cause this.
Re: Hard State
Posted: Tue Jan 21, 2014 5:46 pm
by rentsys
I figured out what it was. I recently set up inbound checks from my slave server and had it exclude localhost only. This meant that it was sending passive checks to my main server and when they would come in it would be in a hard state. I set it from exclude to include and picked the hosts I wanted to monitor. It works great now. This thread can be closed.