Page 1 of 1
Escalations no longer notifying on return to "OK" state.
Posted: Mon Sep 24, 2018 3:50 pm
by HawkFan
System Overview:
CentOS 6.9
64 bit
Virtualized system
Using SSL
Nagios XI 5.5.4 (Problem existed in 5.5.3 and possibly earlier.)
We use service and host escalations for all notifications, and we no longer receive notifications when the status changes back to "OK". Warning and critical notifications are still received. This is impacting multiple services we monitor.
This all previously worked fine for years. I believe one of the last few upgrades may have changed something to cause this problem.
Re: Escalations no longer notifying on return to "OK" state.
Posted: Mon Sep 24, 2018 4:07 pm
by lmiltchev
Can you show us a sample escalation? We will try to recreate the issue in-house. Thank you!
Re: Escalations no longer notifying on return to "OK" state.
Posted: Mon Sep 24, 2018 4:11 pm
by HawkFan
Here you go, thanks. The monitor is service related, but I'll post both host and service escalation config.
define hostescalation {
# config_name Email Hawk Fan
hostgroup_name Hawk
contacts HawkFan
first_notification 1
last_notification 0
notification_interval 5
escalation_period 24x7
escalation_options u,r,d,
}
define serviceescalation {
# config_name Email HawkFan
hostgroup_name Hawk
service_description *
contacts HawkFan
first_notification 1
last_notification 0
notification_interval 5
escalation_period 24x7
escalation_options w,u,c,r,
}
Re: Escalations no longer notifying on return to "OK" state.
Posted: Tue Sep 25, 2018 10:33 am
by HawkFan
To add additional info:
Host Escalation notifications work correctly between states. (Critical > OK)
Service Escalations notifications work correctly for warning and critical, but an OK status is never sent. The problem is isolated to "OK" service escalation notifications. All services appear to be impacted.
Thanks
Re: Escalations no longer notifying on return to "OK" state.
Posted: Tue Sep 25, 2018 12:50 pm
by lmiltchev
We haven't been able to recreate the issue in-house. I tried a similar (very simple) service escalation in Nagios XI 5.5.4.
Code: Select all
define serviceescalation {
# config_name test
hostgroup_name linux-servers
service_description *
contacts ludmil
first_notification 1
last_notification 0
notification_interval 5
escalation_period 24x7
escalation_options w,u,c,r,
}
Code: Select all
define hostgroup {
hostgroup_name linux-servers
alias Linux Servers
members localhost
}
After a couple of my services recovered from a Critical state, recovery (OK) notifications were sent.
example01.PNG
There must be some other issues on your system that caused the issue. I believe this needs to be moved to our ticketing system for faster resolution. Can you open a new ticket via our support center here?
https://support.nagios.com/tickets/