Notifications not sent for down host!!

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
pspagnola
Posts: 51
Joined: Thu Nov 21, 2019 4:40 pm

Notifications not sent for down host!!

Post by pspagnola »

- a server was down for hours.
- event log shows nagios saw everything.
- notification log shows no notification was ever sent.
- I am using host escalation policies. using host filter "*" so it is not a real complex profile.
- I do receive other notifications so i know emails are working.
- This is a very important server!

where can I find logs to figure what went wrong?
Nagios v5.6.12 ~200 hosts 500+ services.
pspagnola
Posts: 51
Joined: Thu Nov 21, 2019 4:40 pm

Re: Notifications not sent for down host!!

Post by pspagnola »

I found this configuration info inside the objects.cache file. I created a fake outage right now, still no alerts.



define host {
host_name --server name here--
display_name -redacted-
alias --server name here--
address -redacted-
check_period xi_timeperiod_24x7
check_command check_xi_host_ping_timeout!1200.0!15%!1850.0!35%!2!!!
contacts nagiosadmin
contact_groups Infrastructure Team
notification_period xi_timeperiod_24x7
initial_state o
importance 0
check_interval 1.000000
retry_interval 1.000000
max_check_attempts 3
active_checks_enabled 1
passive_checks_enabled 1
obsess 1
event_handler_enabled 1
low_flap_threshold 0.000000
high_flap_threshold 0.000000
flap_detection_enabled 1
flap_detection_options a
freshness_threshold 0
check_freshness 0
notification_options r,d
notifications_enabled 1
notification_interval 60.000000
first_notification_delay 0.000000
stalking_options n
process_perf_data 1
icon_image network_node.png
statusmap_image network_node.png
retain_status_information 1
retain_nonstatus_information 1
_XIWIZARD genericnetdevice
}


define hostescalation {
host_name --server name here--
first_notification 1
last_notification 1
notification_interval 7.000000
escalation_period 24x7
escalation_options r,d,u
contact_groups Service Desk Team,Management Team,Infrastructure Team,Development Team
}

define hostescalation {
host_name --server name here--
first_notification 2
last_notification 3
notification_interval 60.000000
escalation_period 24x7
escalation_options r,d,u
contact_groups Management Team,Infrastructure Team - SMS,Infrastructure Team
}

define hostescalation {
host_name --server name here--
first_notification 4
last_notification 0
notification_interval 60.000000
escalation_period 24x7
escalation_options r,d,u
contact_groups Management Team - SMS,Management Team,Infrastructure Team - SMS,Infrastructure Team
}
Nagios v5.6.12 ~200 hosts 500+ services.
pspagnola
Posts: 51
Joined: Thu Nov 21, 2019 4:40 pm

Re: Notifications not sent for down host!!

Post by pspagnola »

I have disabled all the escalation profiles and I am just trying normal notification logic with templates.

The objects.cache file shows all the correct things were inherited from templates, but still no notification in the notifications in the notifications log page. I did an objects.cache host config compare from a host that does have notifications working. nothing relating to notifications is different.
Nagios v5.6.12 ~200 hosts 500+ services.
pspagnola
Posts: 51
Joined: Thu Nov 21, 2019 4:40 pm

Re: Notifications not sent for down host!!

Post by pspagnola »

I deleted the only host dependency rule I had created and now it all works again. Not using any of those rules in the future until I understand what went wrong and do lots more testing.

I had a group of windows servers dependent on a group of ESX servers.
Nagios v5.6.12 ~200 hosts 500+ services.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Notifications not sent for down host!!

Post by scottwilkerson »

pspagnola wrote:I deleted the only host dependency rule I had created and now it all works again. Not using any of those rules in the future until I understand what went wrong and do lots more testing.

I had a group of windows servers dependent on a group of ESX servers.
Glad to see you got it sorted out

Locking thread
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
Locked