Page 1 of 1

Nagios XI 5.7.4 alert notification issues

Posted: Thu Nov 05, 2020 12:50 pm
by styl42
We recently upgraded to 5.7.4 and are experiencing strange behaviour with the email notifications.

Hosts that haven't gone down are sending down notifications and hosts that I've purposely taken offline are shown as down in the Nagios UI but are not sending email notifications.

Re: Nagios XI 5.7.4 alert notification issues

Posted: Thu Nov 05, 2020 3:57 pm
by benjaminsmith
Hi @styl42,

Welcome to the Customer Support Forum and thank you for using Nagios.

I'd like to get a system profile to review the logs. Also, if you can let me know the exact name of the hosts that are sending notification that are not down so I can find them in the logs.

Lastly, if you're using PHPmailer, go ahead and enable logging to help troubleshoot.
phpmailer-logging.png
Thanks, Benjamin

To send us your system profile.
Login to the Nagios XI GUI using a web browser.
Click the "Admin" > "System Profile" Menu
Click the "Download Profile" button
Save the profile.zip file and share in a private message or upload it to the post/ticket, and then reply to this post to bring it up in the queue.

Re: Nagios XI 5.7.4 alert notification issues

Posted: Thu Nov 05, 2020 7:31 pm
by styl42
Not sure how to send you a PM but its attached here.

Logging was already enabled.

PAR-WH1TEMPCORE-SW1 is the node I was testing with-- I received a down notification at 18:24 but the switch did not go offline.

Image

^^ These are the hosts that were reported overnight that never actually went down.

Thank you!

Moderator's Note: The profile has been shared with the support team but has been removed from the public forum.

Re: Nagios XI 5.7.4 alert notification issues

Posted: Thu Nov 05, 2020 9:45 pm
by styl42
phpmailer logs were already enabled.

System profile is attached, couldn't find a way to message it to directly.


PARIAN-WH1TEMPCORE-SW1 is the host I was testing with. I received a down notification at 18:28 but the site was still online.

Below image is all of our hosts that false reported overnight:

Image

^^ all sites were online but were reported offline.

Re: Nagios XI 5.7.4 alert notification issues

Posted: Fri Nov 06, 2020 11:52 am
by benjaminsmith
Hi,

Thank you for the profile. Looking in the Nagios log the state history table is crashed and that is preventing updates to the GUI.

Code: Select all

[1604621947] NDO-3: Unable to prepare statement for query (36): Table './nagios/nagios_statehistory' is marked as crashed and last (automatic?) repair failed
[1604621947] NDO-3: Error preparing statements
[1604621947] NDO-3: ndo_handle_statechange(ndo-handlers.c:1289): Could not reconnect to MySQL database
Open up the terminal on the server and run the following command as root.

Code: Select all

/usr/local/nagiosxi/scripts/repair_databases.sh
Then try to force a check command let me know if the results are updating as expected.
force-immediate-check.png
In this situation, the notifications are probably right since the Nagios process is running without issue. I'm seeing a number failing ping checks which would suggest some network connectivity issues. Are you certain there are not any issues there?

Code: Select all

[1604620650] HOST NOTIFICATION: pmoljgun;498Rickis;DOWN;xi_host_notification_handler;CRITICAL - 10.223.243.1: rta nan, lost 100%
[1604620650] HOST ALERT: 498Rickis;DOWN;HARD;10;CRITICAL - 10.223.243.1: rta nan, lost 100%
Benajmin