No recovery sent

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
comfone
Posts: 127
Joined: Fri May 01, 2015 3:28 am

No recovery sent

Post by comfone »

Dear Nagios Team

Sometimes Nagios XI fails to send a recovery notification. An example from yesterday.

Status
pic1.png
Notifications
pic2.png
Configuration
pic4.png
You do not have the required permissions to view the files attached to this post.
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: No recovery sent

Post by benjaminsmith »

Hi @comfone

Just to clarify sometime, is this intermittent? Have you received recovery notification for this service in the past or is it consistent behavior? There are a number of layers to controlling notifications in Nagios XI, host level, contact level, and also within the XI user account. Please double-check the service object in the CCM to make sure you have recovery notifications enabled.

Additionally, if the contact is also an XI user (has access to the web interface), check the notification preferences.
notification-preferences.png
See: Nagios XI - Notification Problems

A handy way to check the object settings in XI is to open the objects.cache file and search for the object.

Code: Select all

/usr/local/nagios/var/objects.cache
You do not have the required permissions to view the files attached to this post.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
comfone
Posts: 127
Joined: Fri May 01, 2015 3:28 am

Re: No recovery sent

Post by comfone »

Hi Benjamin

With "sometimes" I mean, that for the same service on the same host and the same recipient the recovery message is not always sent. I also didn't make any config changes.

Following Screenshot shows this.

- The panel at the top shows the Notifications for the Service. There was only one recovery sent
- The panel at the bottom shows the State changes for the Service. There were two "Hard" recoveries.
dump.png
Kind Regards

Urs
You do not have the required permissions to view the files attached to this post.
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: No recovery sent

Post by benjaminsmith »

Hi Urs,

Got it, that makes sense. One difference is that it's possible that on 12-7, flap detection may have been initiated ( suppressing the notifications) but I'd have to sort through the nagios log from that day to further troubleshoot.

Can you upload the nagios log from 12-7-20? It would likely be the last entry titled nagios-12-08..*.log in the archives folder by now (rotated every 24 hours).

Code: Select all

/usr/local/nagios/var/archives
Also, please send me the system profile so I can review the configs as well. Thanks, Benjamin

To send us your system profile.
Login to the Nagios XI GUI using a web browser.
Click the "Admin" > "System Profile" Menu
Click the "Download Profile" button
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
comfone
Posts: 127
Joined: Fri May 01, 2015 3:28 am

Re: No recovery sent

Post by comfone »

Hi Benjamin

Will do. Is there a way to share this with you privately. I don't feel comfortable to share this with everyone. Especially the log file.

Kind Regards

Urs
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: No recovery sent

Post by benjaminsmith »

Hi,

Yeah, no problem. If you click the PM icon under my name, you can send me a Private Message and attach the logs.

Otherwise, you can open a support ticket for this issue and your information will be private between you and the support team.

Benjamin
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
comfone
Posts: 127
Joined: Fri May 01, 2015 3:28 am

Re: No recovery sent

Post by comfone »

Hi Benjamin

I've sent you a private message. Any news on this?

Kind Regards

Urs
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: No recovery sent

Post by benjaminsmith »

Hi,

Thanks for the system profile. The host was in HARD down state when the service recovered, and under this condition, service notifications are suppressed in Nagios Core (monitoring engine)
[1607376345] HOST ALERT: csvmo071;DOWN;HARD;1;CRITICAL: No Data received for Host. Host might be down
[1607376422] SERVICE ALERT: csvmo071;SSG-CHECF-PROBE-RUN;OK;HARD;1;alive
If the host is in a hard non-OK state, notifications for services related to the host won't be sent out, this is actually true for soft down as well. For a closer look at state types and notifications, the Nagios Core docs are very helpful.

State Types
Notifications

Let me know if you have any questions or need clarification on anything.

Best Regards,
Benjamin
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
comfone
Posts: 127
Joined: Fri May 01, 2015 3:28 am

Re: No recovery sent

Post by comfone »

Hi Benjamin

Okay I see. So this is a feature not a bug :D. This behaviour probably makes sense for a lot of customer. Unfortunately not for us. Host Recover and Service Recovery are sent to different people. So our NOC gets a service down but never receives a service recovery.

Any way to change this behaviour?

Kind Regards

Urs
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: No recovery sent

Post by benjaminsmith »

Hi Urs,

Typically if the service is not going to recover if the host is down, and suppressing the service notifications when the host is down helps reduce the number of unwanted notifications.

One option is to re-structure the host and service check commands to try and avoid this outcome. These are passive checks, so I am not sure exactly what the check commands are.

Regards,
Benjamin
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
Locked