Would like to delay when Nagios sets a service to Critical

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
caf_infra
Posts: 27
Joined: Wed Apr 20, 2016 10:41 am

Would like to delay when Nagios sets a service to Critical

Post by caf_infra »

Hi,

I was wondering if it’s possible to configure the following behaviour into Nagios.
We currently monitor a bunch of Windows services on some servers in our development environment. We want a fairly granular log of the service state for troubleshooting purposes, but we only want Nagios to mark it as a problem if it’s been down for a significant period of time so that the system doesn’t report a fault every time the service gets bounced due to testing.

i.e.
1. It checks and logs the service state every 5 minutes
2. Services only flip to critical after they’ve been down for a continuous hour rather than the first time it checks and they’re down.

Thanks,
Justine
User avatar
lmiltchev
Bugs find me
Posts: 13589
Joined: Mon May 23, 2011 12:15 pm

Re: Would like to delay when Nagios sets a service to Critic

Post by lmiltchev »

You could use:

Code: Select all

check_interval			5
retry_interval			1
max_check_attempts		60
This way, your host/service will be checked every 5 min. If it goes to soft "non-ok" state, it will be checked every minute up to 60 times, before goes to hard "non-ok" state. There is one caveat though - it you have too many hosts/services that fail at the same time, you will have many checks happening every minute for 1 hour. This can dramatically increase the load on your system.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Locked