Nagios sending out too many alerts

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
johndoe
Posts: 114
Joined: Fri Oct 28, 2011 10:14 am

Re: Nagios sending out too many alerts

Post by johndoe »

I don't see that icon. A bit lost as what you mean by "could be because you have submitted a external command to Nagios disabling and re-enabling notifications."

All I did was create a BPI group, add notifications to it, go to the passive template and disable notifications, should be foolproof really, most of the services and host checks are using "XI passive templates" respectively and have "Skip" on the service/host notifications and "Off" on the templates however they still send notifications... As can be seen from the screenshots the configs should be correct (The screenshots are all for the same service).

If by external command you mean at some point clicking on "Quick Actions > Disable notifications" on the service... It's possible that has been done in the past but shouldn't the template settings get priority if changed more recently than that potential click? It's not viable for me to go to each service and disable notifications for each...

What's the solution for this?
Nagios XI 2012R2.8c Running on Ubuntu 12.04 Using 99% passive checks for monitoring
Monitoring nearly 800 Passive services spread through roughly 40 machines
Running on an 8 core, KVM virtualized VM, with 15 GB of RAM and using RAMDisk
johndoe
Posts: 114
Joined: Fri Oct 28, 2011 10:14 am

Re: Nagios sending out too many alerts

Post by johndoe »

Scott,

I had a look at the MySQL tables for both the nagios database and nagiosql... While doing that i compared a service which was working (not sending notifications with skip/OFF as previously mentioned) with one that wasn't working (still sending notifications)...

What I noticed is that on the nagiosql database (tbl_service table) the rows looked identical for both services yet on the nagios database (nagios_services table) the rows were different on the "notifications_enabled field"...

Again, correct me if i'm wrong but I believe this isn't the correct behaviour from the nagios part (possibly due to an earlier click on the disable/enable notifications like you mentioned before)?

Shouldn't these be synched and the latest action take priority?
Nagios XI 2012R2.8c Running on Ubuntu 12.04 Using 99% passive checks for monitoring
Monitoring nearly 800 Passive services spread through roughly 40 machines
Running on an 8 core, KVM virtualized VM, with 15 GB of RAM and using RAMDisk
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Nagios sending out too many alerts

Post by scottwilkerson »

johndoe wrote:If by external command you mean at some point clicking on "Quick Actions > Disable notifications" on the service... It's possible that has been done in the past but shouldn't the template settings get priority if changed more recently than that potential click?
yes, this is what I meant.

johndoe wrote:Again, correct me if i'm wrong but I believe this isn't the correct behaviour from the nagios part (possibly due to an earlier click on the disable/enable notifications like you mentioned before)?

Shouldn't these be synched and the latest action take priority?
Actually this is the expected behavior, and I'll try to explain why.

Commands such at "Quick Actions > Disable notifications" take the highest precedent.

Lets say your system is up and running and you have notifications off for a host but you decide you need to enable notifications for this host/service and submit the command through "Quick Actions >Enable notifications". this is held at the highest level.

Later, you make some changes to a template that disables notification (similar to your situation), it is applied, and all the hosts/services that have the template on it get the new value on the underlying template, however you SPECIFICALLY set this host/service to be enabled at the highest level.

I hope this makes sense. During configuration changes it may not seem the best option, but you certainly wouldn't want another Admin applying a configuration and it overwriting the fact that you disabled/enabled a specific host/services.
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
johndoe
Posts: 114
Joined: Fri Oct 28, 2011 10:14 am

Re: Nagios sending out too many alerts

Post by johndoe »

So what is the best/quickest way for me to ensure those that were clicked on don't take that "external command" change and change accordingly to the template (until i click on them again :))
Nagios XI 2012R2.8c Running on Ubuntu 12.04 Using 99% passive checks for monitoring
Monitoring nearly 800 Passive services spread through roughly 40 machines
Running on an 8 core, KVM virtualized VM, with 15 GB of RAM and using RAMDisk
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Nagios sending out too many alerts

Post by scottwilkerson »

If you nagios.cfg has

Code: Select all

log_external_commands=1
You could go to Home -> Event Log and search for "ENABLE_SVC_NOTIFICATIONS"

If you are not logging external commands (the default) there isn't an elegant way to see this. You could however stop nagios, remove /usr/local/nagios/var/retention.dat file and restart nagios.

Code: Select all

service nagios stop
rm -f /usr/local/nagios/var/retention.dat
service nagios start
WARNING: This reset all of the info nagios is retaining, including ALL commands submitted, this include problem acknowledgments as well as short term state history (used to determine if host/service is flapping and if it is time to send a notification). ALL hosts/service will show a pending state until they make their next check.

For some organizations this may be no big deal, for others though it could be a real headache if you had hundereds of notifications disabled manually or problems acknowledged.
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
Locked