Escalation not working correctly

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Escalation not working correctly

Post by bosecorp »

I have a set of host that I haven seen an escalation of quite some time now.

I have read all documentation and I think I have setup everything correctly.

One thing to note is that if I do a BROADCAS (custom) notification from a particular host, the people that should get the notification get the notification. including non-escalated and escalated. This almost suggest that host escalation doest have the right criteria

here is my config. here is my config. at this point I have made some many changes that I wont be surprise if you find errors. I have tried every possible combination


================host escalation
define hostescalation {
# config_name CIS-IO-OPS-FS_iPad-Prod-Tier1_email
hostgroup_name EndPoint Devices - BL,EndPoint Devices - FHM - CC,EndPoint Devices - FHM - PP,EndPoint Devices - FHM - RES,EndPoint Devices - FHM - RND,EndPo
int Devices - STW
contacts dcopstcs_CIS_IO_OPS_OCC_email_page
contact_groups CIS-IO-OPS-OCC-Tier1_email,CIS-IO-SDM-FS-Tier1_email
first_notification 3
last_notification 4
notification_interval 2
escalation_period xi_timeperiod_24x7
escalation_options u,r,d,

================host template
define host {
name ht_cis_io_ops_ma_iPad_prod
check_command check_xi_host_ping!3000.0!80%!5000.0!100%!!!!
initial_state o
max_check_attempts 2
check_interval 2
retry_interval 1
active_checks_enabled 1
check_period xi_timeperiod_24x7
flap_detection_enabled 0
process_perf_data 1
retain_status_information 1
retain_nonstatus_information 1
contacts nagiosadmin
notification_interval 2
notification_period xi_timeperiod_24x7
notification_options d,u,r,f,
notifications_enabled 1
icon_image apple.png
statusmap_image apple.png
_xiwizard switch
register 0

================contact template
define contact {
name ct_cis_io_ops_fs_prod_email
alias ct_cis_io_ops_fs_prod_email
host_notifications_enabled 1
service_notifications_enabled 1
host_notification_period xi_timeperiod_24x7
service_notification_period xi_timeperiod_24x7
host_notification_options d,r,
service_notification_options w,c,r,
host_notification_commands notify-host-by-email
service_notification_commands notify-service-by-email_text
retain_status_information 1
retain_nonstatus_information 1
register 0


=======host
define host {
host_name ma-rd-1133.bose.com
use ht_cis_io_ops_ma_iPad_prod
address x.x.x.x
hostgroups EndPoint Devices - FHM,EndPoint Devices - FHM - RND,hgt_stacked_endpoint_iPad
register 1
}



=========contact group

define contactgroup {
contactgroup_name CIS-IO-SDM-FS-Tier1_email
alias CIS-IO-SDM-FS-Tier1_email
members am43551_CIS_IO_SDM_FS_email,dcops_CIS_IO_OPS_OCC_email_page,dn9576_CIS_IO_SDM_FS_email,gb1010343_CIS_IO_SDM_FS_email,jc71300_CIS_IO_SDM_FS_ema
il,sa1009249_CIS_IO_SDM_FS_email,tw3489_CIS_IO_SDM_FS_email,wj1013103_CIS_IO_SDM_FS_email
}
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: Escalation not working correctly

Post by tgriep »

What version of XI is the server running?
How long was the Host Down for?
Are all of the Escalations for that configuration not working or is it certain ones that are not getting generated?
Be sure to check out our Knowledgebase for helpful articles and solutions!
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: Escalation not working correctly

Post by bosecorp »

I am running 5.3.3

I have waited for 30 minutes and nothing, but also I have seen devices down for couple days and still nothing.

Only certain escalation dont seem to be working
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: Escalation not working correctly

Post by tgriep »

Can you post one that is working so we can compare it?
Also, check the objects.cache file to see if the Escalation settings looks correct.

Make sure you do not have multiple copies of nagios running so run the following as root to stop them all and start them up.

Code: Select all

service nagios stop
killall -9 nagios
service nagios start
After this, see if the Escalation starts to function.
Be sure to check out our Knowledgebase for helpful articles and solutions!
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: Escalation not working correctly

Post by bosecorp »

I dont see the host in the objects.cache file.
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: Escalation not working correctly

Post by tgriep »

That's strange, can you check to see if the host is still active or it it still exists in the Core Config Manager?
If the host does not exist in the objects.cache file, it would not be able to generate any notifications at all.
Be sure to check out our Knowledgebase for helpful articles and solutions!
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: Escalation not working correctly

Post by bosecorp »

it's active in the config manager
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: Escalation not working correctly

Post by bosecorp »

nevermind, I was looking at the wrong file. Forgot that I moved the object.cache file to ramdisk.

I also tried restarting nagios
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: Escalation not working correctly

Post by bosecorp »

One thing to note, is that when I go notifications, I so see notifications getting send out. but only for whatever is defined on the host itself. But when it comes to escalation, escalation dont seem to take over
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: Escalation not working correctly

Post by tgriep »

I setup a XI 5.3.3 server to test out Host Escalations.
I setup the system as close as to your examples that you provided and I did get the server to send out a Escalated Host Problem.
I am still in the processes of testing but so far, it seems to be working.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Locked