Page 1 of 1

Hard State

Posted: Tue Jan 21, 2014 1:50 pm
by rentsys
Recently this has been happening

Code: Select all

2014-01-19 07:38:13	ppmon drac		DOWN	HARD	1 of 7	CRITICAL - 10.148.148.18: rta nan, lost 100%
it skips soft state and goes straight to hard state. what would cause this? It doesn't happen every time.

Re: Hard State

Posted: Tue Jan 21, 2014 1:57 pm
by lmiltchev
Can you post the host/service/template definition? Hide the sensitive info.

Re: Hard State

Posted: Tue Jan 21, 2014 2:03 pm
by rentsys

Code: Select all

###############################################################################
#
# Host configuration file
#
# Created by: Nagios QL Version 3.0.3
# Date:       2014-01-21 12:09:53
# Version:    Nagios 3.x config file
#
# --- DO NOT EDIT THIS FILE BY HAND ---
# Nagios QL will overwite all manual settings during the next update
#
###############################################################################

define host {
        host_name                       ppmon drac
        use                             xiwizard_generic_host
        alias                           alias
        address                         xx.xxx.xxx.xx
        parents                         xxxxxxxxxxx
        max_check_attempts              7
        check_interval                  10
        retry_interval                  2
        check_period                    xi_timeperiod_24x
        contact_groups                  admins
        notification_interval           60
        notification_period             xi_timeperiod_24x
        _xiwizard                       autodiscovery
        register                        1
        }

###############################################################################
#
# Host configuration file
#
# END OF FILE
#
###############################################################################

Re: Hard State

Posted: Tue Jan 21, 2014 2:51 pm
by rentsys
I have a back up server and it monitors the same devices. It had the right check.

Code: Select all

2014-01-19 02:40:50	ppmon drac		DOWN	SOFT	1 of 7	CRITICAL - 10.148.148.18: rta nan, lost 100%

Re: Hard State

Posted: Tue Jan 21, 2014 4:14 pm
by slansing
Do you passively updating the host? That is the only thing I can think of immediately that would cause this.

Re: Hard State

Posted: Tue Jan 21, 2014 5:46 pm
by rentsys
I figured out what it was. I recently set up inbound checks from my slave server and had it exclude localhost only. This meant that it was sending passive checks to my main server and when they would come in it would be in a hard state. I set it from exclude to include and picked the hosts I wanted to monitor. It works great now. This thread can be closed.