parent host is not supressing recovery messages from child h

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
Locked
User avatar
Florin Andrei
Posts: 11
Joined: Mon Aug 26, 2013 6:04 pm
Location: California
Contact:

parent host is not supressing recovery messages from child h

Post by Florin Andrei »

Using the Network Reachability document, I've assigned the main routers on various networks as parent hosts to all hosts in their respective networks. When those networks go offline, I'm only notified that the parent router is offline. This is great.

However, when the network recovers, the parent router is quickly marked online, whereas many services on that network are still unreachable. Then comes the time for them all to be checked again, and they recover - and I get a whole bunch of recovery messages.

Is there a way to avoid this? Somehow silence the recovery messages if the cause was the parent router going offline?

It seems like using a parent host only protects me from the deluge of "service unreachable" messages, but not from the other deluge of recovery messages.

This is on Nagios 3.2.x
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: parent host is not supressing recovery messages from chi

Post by slansing »

You could do this a couple ways, One would be setting up notification_options, in your service .cfg files like so:

Code: Select all

notification_options		w,c,u
In this example, the service will only send out an alert if it reaches a warning, critical, or unknown state. Nothing else, if you wanted recovery alerts for example, you would just add "r" for recovery in the same fashion as above.

Another way would be to create a separate contact for these devices who's settings do not allow for recovery notifications, and that would be done in the same fashion as above as well.
User avatar
Florin Andrei
Posts: 11
Joined: Mon Aug 26, 2013 6:04 pm
Location: California
Contact:

Re: parent host is not supressing recovery messages from chi

Post by Florin Andrei »

slansing wrote:You could do this a couple ways, One would be setting up notification_options, in your service .cfg files like so:

Code: Select all

notification_options		w,c,u
In this example, the service will only send out an alert if it reaches a warning, critical, or unknown state. Nothing else, if you wanted recovery alerts for example, you would just add "r" for recovery in the same fashion as above.
This would also suppress recovery notifications when a service goes down on its own (just this one service, not the whole network and the router in front of it) and then comes back, is it not?
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: parent host is not supressing recovery messages from chi

Post by abrist »

Are you receiving DOWN notifications for the hosts inquestion before the router is marked as UNREACHABLE?
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
Locked