Page 2 of 2

Re: Connection Timeout/Refused State

Posted: Mon Aug 06, 2012 2:16 pm
by admin
Hi Fred -

Are you still getting alerts for services on a host when the host is DOWN? Nagios should be suppressing service notifications for hosts that are DOWN or UNREACHABLE.

Regarding checks, Nagios will still check services on a host that is DOWN or UNREACHABLE. The reason for this is that some services might be related to attributes/metrics related to the host, but that are not "on" the host. DNS name lookups are one example of services that might be associated with a host, but would still be in an OK state if the host went DOWN.

Currently there's not an easy way to prevent services from being checked when a host is DOWN. Some people have utilized service dependencies (referencing the Ping service as the service to use in the dependency) or external commands and event handlers to temporarily disable service checks when a host goes down, but they're not foolproof or easy to setup.

Re: Connection Timeout/Refused State

Posted: Mon Aug 20, 2012 1:13 am
by Fred Kroeger
Hi
I don't have a problem with the services being checked of a host that's down, what I don't think is right is that they are shown as CRITICAL in Nagios.
When we have a host go DOWN, then we have many Critical events also showing which coud hide/mask a real CRITICAL event.
My way of thinking is that as the service has a dependancy of the host being up, then it should have an UNKNOWN status when the host goes down, because the actual CRITICAL threshold hasn't been reached.


regards... Fred

Re: Connection Timeout/Refused State

Posted: Mon Aug 20, 2012 9:12 am
by scottwilkerson
Actually, if you have service dependencies setup, Nagios won't check the dependent services if the services they depend on is in a critical state.

Re: Connection Timeout/Refused State

Posted: Mon Aug 20, 2012 9:14 am
by mguthrie
To my knowledge the services in this circumstance are considered to be "handled" problems instead of open or unhandled. The services reading as critical when the host is down is expected behavior. Some users may only be authorized at the service level, so the current state still needs to be accurately displayed.

Re: Connection Timeout/Refused State

Posted: Mon Aug 20, 2012 11:42 pm
by Fred Kroeger
The point I'm trying to make (unsuccessfuly it appears) is that the status of the service should be UNKNOWN . CRITICAL indicates that the service has crossed a threshold. Technically if the host is DOWN, then the results of a service check are UNKNOWN ?

Re: Connection Timeout/Refused State

Posted: Tue Aug 21, 2012 2:44 pm
by CGraham
Some people have utilized service dependencies (referencing the Ping service as the service to use in the dependency)
This is the strategy we use. It's a pain to setup but worth it. Even though you don't get service notifications after the host is down, our service checks fail faster than our host checks, so we get slammed when a server goes down.