reduced (single) alert for one location with many devices

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
pnewlon
Posts: 86
Joined: Mon May 16, 2011 2:19 pm

Re: reduced (single) alert for one location with many device

Post by pnewlon »

nscott wrote:Thats a good point, i'll look into that. But are your problems resolved?
Kinda? Not really? It appears to be a timing problem - services scheduled for check on hosts behind a router before the router itself is checked. As soon as the router is in 'critical', the remaining hosts/services behind it are not checked. I the case of an outage this afternoon, I got ten (out of 24) alerts before the router outage stopped the flow.

Image
User avatar
nscott
Posts: 1040
Joined: Wed May 11, 2011 8:54 am

Re: reduced (single) alert for one location with many device

Post by nscott »

Ok. Now thats expected logic, that may seem broken at first, but it would cause quite a bit of stress on the target system if every time a service returned critical it ran a check on the host before sending a notification. A way around this problem would be to increment the amount of times the service check must return critical before sending a notification.
Nicholas Scott
Former Nagios employee
pnewlon
Posts: 86
Joined: Mon May 16, 2011 2:19 pm

Re: reduced (single) alert for one location with many device

Post by pnewlon »

I have the 'far hosts' checking every five minutes and the routers every one minute (there are only 34 of them). Router goes critical in five minutes and 'far hosts' ten. We'll see how that goes.
User avatar
nscott
Posts: 1040
Joined: Wed May 11, 2011 8:54 am

Re: reduced (single) alert for one location with many device

Post by nscott »

Ok, let me know.
Nicholas Scott
Former Nagios employee
pnewlon
Posts: 86
Joined: Mon May 16, 2011 2:19 pm

Re: reduced (single) alert for one location with many device

Post by pnewlon »

nscott wrote:Thats a good point, i'll look into that. But are your problems resolved?
Acceptably :-)
Locked