Only some contacts being alerted
Posted: Mon Jan 30, 2017 6:31 pm
Hello. I'm trying to track down an alerting issue. I have 4 contacts assigned to a group that should be alerted on host and service problems. But only 3 of the contacts are being alerted. I have checked the contacts.cfg file on the backend to confirm the contacts are configured identically. The Nagios GUI shows alerts were sent to all 4 contacts. Nagios logs show the alert was sent. But... the system's maillog only shows messages going out for 3 of the contacts.
How can i figure out exactly what is going on?
The contact that is not working is named "oncall". It is an e-mail address that goes to a pager. I can see alerts in /usr/local/nagios/var/nagios.log, for example:
I tried turning on debug logging, and can confirm that Nagios thinks it is sending alerts to all 4 contacts:
But when i search the system's /var/log/maillog, i only see messages for the 3 contacts that work, not for the oncall one. I have confirmed that i can send messages at the command line (using "mail") to the oncall contact's e-mail address.
Support info: Server is Red Hat Enterprise Linux 7 (x86_64). Nagios XI was installed manually.
How can i figure out exactly what is going on?
The contact that is not working is named "oncall". It is an e-mail address that goes to a pager. I can see alerts in /usr/local/nagios/var/nagios.log, for example:
Code: Select all
[1485468150] SERVICE NOTIFICATION: oncall;mobile;Disk Space via NRPE;CRITICAL;xi_service_notification_handler;DISK CRITICAL - free space: / 160 MB (1% inode=81%):
[1485469641] SERVICE NOTIFICATION: oncall;mobile;Disk Space via NRPE;FLAPPINGSTART (OK);xi_service_notification_handler;DISK OK
[1485471229] SERVICE NOTIFICATION: oncall;mobile;Disk Space via NRPE;FLAPPINGDISABLED (CRITICAL);xi_service_notification_handler;DISK CRITICAL - free space: /tmp 47 MB (2% inode=99%):
Code: Select all
[1485471229.857779] [032.0] [pid=24052] ** Service Notification Attempt ** Host: 'mobile', Service: 'Disk Space via NRPE', Type: FLAPPINGDISABLED, Options: 0, Current State: 2, Last Notification: Wed Dec 31 18:00:00 1969
[1485471229.857802] [032.0] [pid=24052] Notification viability test passed.
[1485471229.857806] [032.1] [pid=24052] Current notification number: 0 (unchanged)
[1485471229.857819] [032.1] [pid=24052] Service notification will NOT be escalated.
[1485471229.857822] [032.1] [pid=24052] Adding normal contacts for service to notification list.
[1485471229.858182] [032.0] [pid=24052] 4 contacts were notified.
Support info: Server is Red Hat Enterprise Linux 7 (x86_64). Nagios XI was installed manually.