Problem: A host goes into an unreachable state or a service goes into an unknown state. The contacts for the service/host are not notified because the service/host notification options do not include these states (unknown/unreachable). When the host or service goes back into
an ok state, the contact is sent the recovery notification. This is incorrect. Recovery notifications should not be sent if the problem
notification was not sent.
Using this config:
Code: Select all
define contact {
contact_name jr2
alias J
host_notifications_enabled 1
service_notifications_enabled 1
host_notification_period 24x7
service_notification_period 24x7
host_notification_options d,r
service_notification_options c,r
email nobody@example.com
use generic-contact
}
define hostgroup {
hostgroup_name NotificationTest
alias Testing notification bug
}
define service {
service_description NotificationTest
use active-passive-service
active_checks_enabled 1
hostgroup_name NotificationTest
check_command check_dummy!3!notification test!
register 1
}
define host {
host_name notification-test
use active-passive-host
check_command check_dummy!2!notification test!
active_checks_enabled 1
parents notification-test-upstream
alias notification-test
address 192.168.10.10
hostgroups NotificationTest
contacts jr2
register 1
}
define host {
host_name notification-test-upstream
use active-passive-host
check_command check_dummy!2!notification test!
active_checks_enabled 1
alias notification-test-upstream
address 192.168.10.10
hostgroups NotificationTest
contacts jr2
register 1
}
recovery notification is sent. Similarly let the host notification-test go into a hard unreachable state. Then manually clear it.
A notification will be sent to jr2.
I can not replicate this on our older nagios instance using: NagiosXI 2014R2.0 Centos 6.5.
In that case submitting a clear/ok for the unreachable state of notification-test or the unknown state of NotificationTest does not send a recovery notification to jr2. This is the desired and documented result per: https://assets.nagios.com/downloads/nag ... tions.html
I have verified that the expanded objects as written to object.cache are the same on both the 2014R2.0 and 5.5.1 instances."The third host or service filter that must be passed is the host- or service-specific notification options. Each service definition contains options that determine whether or not notifications can be sent out for warning states, critical states, and recoveries. Similarly, each host definition contains options that determine whether or not notifications can be sent out when the host goes down, becomes unreachable, or recovers. If the host or service notification does not pass these options, no one gets notified. If it does pass these options, the notification gets passed to the next filter. Note: Notifications about host or service recoveries are only sent out if a notification was sent out for the original problem. It doesn't make sense to get a recovery notification for something you never knew was a problem."
Any idea what is happening here? This is a showstopper for us deploying the new nagios instances.
Thanks.
-- rouilj
John Rouillard