reduced (single) alert for one location with many devices
Posted: Wed Jul 20, 2011 2:13 pm
I have 34 locations (remote, connected via broadband and VPN tunnels). At each location there are approximately 18 devices (+/- a few) with 2-3 services defined for each device. If the broadband connection goes down for a given remote location, I get 'critical' messages for ALL the services Nagios cannot get to. I turned off all host notifications which reduced resultant email deluge a bit, and turned re-notification down to 8 hours but it is still a pretty big load of email notices to read on a blackberry when 2-3 locations are down for a day or so.
Is there a way to consolidate the devices into a single alert? I thought about using check_cluster but I would get a cluster alert for any single device/service down at a location. When the broadband connection is up, it is nice to only get a specific alert about a single device/service. I have the ability to monitor the router connecting the remote location, though I am not now doing it. If I defined the router (host / service) is there a way to say 'if you can't get to this device, refrain from alerting about all the devices behind it'?
Thanks! Phil
Is there a way to consolidate the devices into a single alert? I thought about using check_cluster but I would get a cluster alert for any single device/service down at a location. When the broadband connection is up, it is nice to only get a specific alert about a single device/service. I have the ability to monitor the router connecting the remote location, though I am not now doing it. If I defined the router (host / service) is there a way to say 'if you can't get to this device, refrain from alerting about all the devices behind it'?
Thanks! Phil