Monitoring Engine won't start after upgrade
- sathevaner
- Posts: 62
- Joined: Wed Nov 29, 2017 11:00 am
Monitoring Engine won't start after upgrade
I just upgraded to the latest version of Nagios, manually on the command line, and the monitoring engine refuses to start. The web UI upgrade procedure was left in a hung "Update in progress. Please wait. Update may take a few minutes," Upon reporting that it is running 5.6.3, it is still stuck in that state.
Please advise on next steps for troubleshooting.
Please advise on next steps for troubleshooting.
-
benjaminsmith
- Posts: 5324
- Joined: Wed Aug 22, 2018 4:39 pm
- Location: saint paul
Re: Monitoring Engine won't start after upgrade
Hi @sathevaner,
There's a known issue with the web upgrade stalling on occasion. Please follow the guide to reset the upgrade status.
Nagios XI - Reset Upgrade Status In Web Interface
Then restart Nagios:
Let me know if the Monitoring Engine status is green again.
There's a known issue with the web upgrade stalling on occasion. Please follow the guide to reset the upgrade status.
Nagios XI - Reset Upgrade Status In Web Interface
Then restart Nagios:
Code: Select all
systemctl restart nagios
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Be sure to check out our Knowledgebase for helpful articles and solutions!
- sathevaner
- Posts: 62
- Joined: Wed Nov 29, 2017 11:00 am
Re: Monitoring Engine won't start after upgrade
Hello, @benjaminsmith,
Thank you for the manual reset instructions for the database backend. This did not resolve the Monitoring Engine issue, as it is still refusing to start. I am unclear on why it will not do so after the manual upgrade.
Thank you for the manual reset instructions for the database backend. This did not resolve the Monitoring Engine issue, as it is still refusing to start. I am unclear on why it will not do so after the manual upgrade.
-
benjaminsmith
- Posts: 5324
- Joined: Wed Aug 22, 2018 4:39 pm
- Location: saint paul
Re: Monitoring Engine won't start after upgrade
Hello,
Is the server showing Nagios 5.6.3 in the lower left side of the screen? Let's try stopping everything and then re-start the whole Nagios stack. Please run the following from the terminal:
If it does not come back up, can you PM your system profile so we can take a closer look at the logs. Thanks.
To send us your system profile.
Login to the Nagios XI GUI using a web browser.
Click the "Admin" > "System Profile" Menu
Click the "Download Profile" button
Save the profile.zip file and share in a private message or upload it to the post/ticket.
Is the server showing Nagios 5.6.3 in the lower left side of the screen? Let's try stopping everything and then re-start the whole Nagios stack. Please run the following from the terminal:
Code: Select all
systemctl stop crond
systemctl stop npcd
systemctl stop nagios
systemctl stop ndo2db
pkill -9 -u nagios
rm -rf /usr/local/nagiosxi/var/dbmaint.lock
rm -rf /usr/local/nagiosxi/var/event_handler.lock
rm -rf /usr/local/nagiosxi/scripts/reconfigure_nagios.lock
systemctl restart mysqld || systemctl restart mariadb
systemctl start ndo2db
systemctl start nagios
systemctl start npcd
systemctl start crond
systemctl restart httpd
To send us your system profile.
Login to the Nagios XI GUI using a web browser.
Click the "Admin" > "System Profile" Menu
Click the "Download Profile" button
Save the profile.zip file and share in a private message or upload it to the post/ticket.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Be sure to check out our Knowledgebase for helpful articles and solutions!
- sathevaner
- Posts: 62
- Joined: Wed Nov 29, 2017 11:00 am
Re: Monitoring Engine won't start after upgrade
Restarting all the relevant services did not result in the Monitoring Engine starting. Attached is the System Profile.
You do not have the required permissions to view the files attached to this post.
Re: Monitoring Engine won't start after upgrade
This is why, it's because your configuration is invalid:
You can validate with:
Please go to Configure > Core Config Manager > Services:
- Edit that service and make sure all these are set:
- Save
Then go to Configure > Core Config Manager > Tools > Config File Management:
- Click the Delete Files button (don't worry, it's safe, they will be rewritten)
- Then click the Write Configs button
- Then click the Verify Files button, if it verifies properly, try to start the nagios service from the CLI.
- If it doesn't verify, keep fixing the errors it shows until it does.
Code: Select all
Error: Invalid max_attempts, check_interval, retry_interval, or notification_interval value for service 'Check status of APC battery' on host 'APC-UPS-ITEC-043'
Error: Could not register service (config file '/usr/local/nagios/etc/services/APC-UPS.cfg', starting on line 16)Code: Select all
/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg- Edit that service and make sure all these are set:
Code: Select all
Invalid max_attempts, check_interval, retry_interval, notification_intervalThen go to Configure > Core Config Manager > Tools > Config File Management:
- Click the Delete Files button (don't worry, it's safe, they will be rewritten)
- Then click the Write Configs button
- Then click the Verify Files button, if it verifies properly, try to start the nagios service from the CLI.
- If it doesn't verify, keep fixing the errors it shows until it does.
Re: Monitoring Engine won't start after upgrade
Additionally (see previous post): When an apply configuration fails, it reverts the files on the filesystem to the last known-good state so that your monitoring continues to work, so... to get them into a bad state so I can review exactly what the issues is, you need to do the steps below exactly without any steps in between:
Please go to Configure > Core Config Manager > Tools > Config File Management:
- Click the Delete Files button (don't worry, it's safe, they will be rewritten)
- Then click the Write Configs button
- When they are done being written, run this command before doing anything else (don't apply config or anything):
Then attach the resulting /tmp/NAGIOSBADFILES.zip file to the ticket so that I can review it.
Please go to Configure > Core Config Manager > Tools > Config File Management:
- Click the Delete Files button (don't worry, it's safe, they will be rewritten)
- Then click the Write Configs button
- When they are done being written, run this command before doing anything else (don't apply config or anything):
Code: Select all
zip -r /tmp/NAGIOSBADFILES.zip /usr/local/nagios/etc- sathevaner
- Posts: 62
- Joined: Wed Nov 29, 2017 11:00 am
Re: Monitoring Engine won't start after upgrade
Thank you, I was unaware that this was a legit host in our list. I have corrected the issue. This thread may now be closed.
-
benjaminsmith
- Posts: 5324
- Joined: Wed Aug 22, 2018 4:39 pm
- Location: saint paul
Re: Monitoring Engine won't start after upgrade
Hi,
Thanks for the update and glad you got this worked out.Thank you, I was unaware that this was a legit host in our list. I have corrected the issue. This thread may now be closed.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Be sure to check out our Knowledgebase for helpful articles and solutions!