Page 1 of 1

e-mail alert issue

Posted: Thu Nov 08, 2012 10:11 am
by GaWd
Hello Everyone,

I have a small-to-mid-sized Nagios installation that has been operable for quite a few years now. We have ~100 hosts and ~500 checks. The software is fully updated.

Since the last update, I have twice experienced a situation where an alert has been sent only to the first recipient in the contact list. The other two contacts never have any e-mail sent from Nagios to the mail program on the Nagios server. Restarting Nagios resolves the issue. As I go through the maillog, I see that the #1 recipient was sent alert e-mails, but the next 2 recips weren't. It's rather odd.

Has anyone else seen or reported this behavior?

Re: e-mail alert issue

Posted: Thu Nov 08, 2012 3:21 pm
by agriffin
I haven't seen this issue myself, no.

If you want help with it, we'd need more information. Were there not any config changes since the upgrade? How is Nagios installed/upgraded on your system (from source, via packages, etc.)?

Re: e-mail alert issue

Posted: Fri Nov 09, 2012 5:40 pm
by GaWd
Hi agriffin,

This is a standard source install on a CentOS 5 computer. There were no systemwide changes made before, after, or during this period. Our alerts are set up using contacts that are grouped using contactgroups (pretty standard). The first time, our SMS addresses were not being sent e-mail, but our corporate e-mail was. This time, SMS e-mail was working (one contact group), but corporate e-mail was only going to 1 of 3 technicians in the corporate e-mail contactgroup.

So in this most recent example (below), only the first user received any e-mails:


define contactgroup {
contactgroup_name Email - Group IT Support
alias IT Support Group
members Name 1,Name 2,Name 3
}

This appears to have first happened in the beginning of October. The first time in early October, the corporate e-mail (above contactgroup) worked just fine, but the SMS message list either sent to the first but not the rest of the recips did not receive mail to that contact group.

Re: e-mail alert issue

Posted: Tue Nov 13, 2012 11:24 am
by GaWd
Any ideas?

Thanks!

Re: e-mail alert issue

Posted: Tue Nov 13, 2012 2:45 pm
by agriffin
It could have been caused by notification escalations being misconfigured. Then everybody would normally receive notifications except when the escalation is triggered. Or maybe you haven't enabled notifications for some uncommon check results like UNKNOWN, or UNREACHABLE?

If it was truly caused by the upgrade and not a configuration mistake, then it's a bug that will be very hard to track down and fix, since you can't seem to reproduce it or identify what caused it.