Page 1 of 1

Nagios issues after upgrade to 5.7.3

Posted: Fri Oct 02, 2020 1:28 am
by scharft
Hi,

yesterday at 11am i upgradet Xi to 5.7.3 and after upgrade everything looks fine.
Some hours later, nagios doesn't check any hosts/services for e.g. 1-2 hours.
At 5pm XI doesn't do any passive check and also sending passive check result had no effect.

the system status and the monitoring engine looked good....

Re: Nagios issues after upgrade to 5.7.3

Posted: Fri Oct 02, 2020 4:26 pm
by benjaminsmith
Hi @scharft,

If there was an issue with the database before upgrading it could have an effect on database writes, so let's open the terminal and run the following command as root:

Code: Select all

/usr/local/nagiosxi/scripts/repair_databases.sh
If that does not resolve the issue and this is a production monitoring system (not your test instance), please follow the steps below to downgrade the backend database to ndo2db.

Code: Select all

systemctl stop nagios
cd /tmp
rm -rf /tmp/nagiosxi
wget https://assets.nagios.com/downloads/nagiosxi/5/xi-5.6.14.tar.gz
tar zxf xi-5.6.14.tar.gz
cd /tmp/nagiosxi/subcomponents/ndoutils
./install
systemctl enable ndo2db
Then edit your /usr/local/nagios/etc/nagios.cfg and make sure this line is uncommented:

Code: Select all

broker_module=/usr/local/nagios/bin/ndomod.o config_file=/usr/local/nagios/etc/ndomod.cfg
Make sure this line is commented:

Code: Select all

#broker_module=/usr/local/nagios/bin/ndo.so /usr/local/nagios/etc/ndo.cfg
Then start the nagios service:

Code: Select all

systemctl start nagios
Otherwise, send us your system profile to help troubleshoot this further.

Thanks,
Benjamin

To send us your system profile.
Login to the Nagios XI GUI using a web browser.
Click the "Admin" > "System Profile" Menu
Click the "Download Profile" button
Save the profile.zip file and share in a private message or upload it to the post/ticket, and then reply to this post to bring it up in the queue.

Re: Nagios issues after upgrade to 5.7.3

Posted: Wed Oct 07, 2020 8:30 am
by scharft
the checks need approximately 60min after a reboot of the instance or if we save the setting.
the profile is attached.
I don't want to downgrade in the production system to the ndo2db if it's possible

thank you

Re: Nagios issues after upgrade to 5.7.3

Posted: Wed Oct 07, 2020 5:25 pm
by benjaminsmith
Hi scharft,

Sorry about the inconvenience, but could re-attach or PM the system profile once more?

Also, please try to run the repair script, and let me know if you notice any improvement. A crashed database table can impact status updates.

Code: Select all

/usr/local/nagiosxi/scripts/repair_databases.sh
Thanks, Benjamin

Re: Nagios issues after upgrade to 5.7.3

Posted: Fri Oct 09, 2020 1:43 am
by scharft
Hi

repairing the database doesn't help to get the active&passive checks starting.
After restarting the monitoring engine e.g. apply configuration, ALL active & passive checks doesn't work for ~ 30min.
i did multiple times an immediate check without success


best

Moderator's Note: The profile has been shared with the support team but has been removed from the public forum.

Re: Nagios issues after upgrade to 5.7.3

Posted: Fri Oct 09, 2020 1:21 pm
by benjaminsmith
Hi,

Thanks for the profile. I understand why you do not want to downgrade the production system, but I think it may be the best short term resolution since I'm not seeing any NDO related errors in the Nagios log. I would recommend keeping this system with the downgraded version of NDO until you decide to migrate to Cent 7 or 8 in the near future ( Cent 6 will be EOL on 11/30).

We do have a new version coming out soon with bug fixes and improved logging for the new database backend if you're able to hold off on the downgrade.

Let us know what you decide.

Also, if you want to test anything out before making changes do the production system, each Nagios XI license allows for 3 activations - production, test, and backup.

Best Regards,
Benajmin