Think I was able to get it fixed.
Turned out / only had 1% of space left, even though the upgrade did not warn of space issues.. might not have been though..
The Sudoers file issue is still an issue as Redhat is switching to /etc/sudoers.d/username versus /etc/sudoers - unix admin put this server back but still need to work around this if you can.
What I did to get everything back to normal:
Did some research and looking at nagios scripts and found the upgrade_to_latest.sh. Looked it over and decided "why not at this point".
I downloaded xi-latest tarball and put it in tmp and then executed "upgrade_to_latest.sh -t -f /tmp/xi-latestxxxx
This is how it ran
Code: Select all
---- Starting Nagios XI Backup ----
Backing up Core Config Manager (NagiosQL)...
tar: Removing leading `/' from member names
tar: Removing leading `/' from member names
Backing up Nagios Core...
tar: Removing leading `/' from member names
tar: /usr/local/nagios/var/ndo.sock: socket ignored
tar: /usr/local/nagios/var/rw/nagios.qh: socket ignored
Backing up Nagios XI...
tar: Removing leading `/' from member names
Backing up MRTG...
tar: Removing leading `/' from member names
Backing up MySQL databases...
Backing up PostgresQL databases...
Backing up logrotate config files...
Backing up Apache config files...
Compressing backup...
===============
BACKUP COMPLETE
===============
Backup stored in /store/backups/nagiosxi/autoupgrade_backup..tar.gz
---- Starting Nagios XI Upgrade ----
Cleaning up temp directory...
Downloading Latest Nagios XI Tarball...
And that's it. Then just said Upgrade completed successfully. Now when I looked in the upgrade directory there is no entry for todays date so sure that is due to this rough process to get back on track.
Assuming that the first run from the gui did most of the work and either because of the sudoers issues, or something else weird, caused the "nagiosql login failed!" from my first post. This might explain why the log above just shows Downloading Latest XI Tarball, even though I had it in /tmp and pointed the script there.
Anyway, I did a quick check.. The Host creation where you select hostgroups and contacts, the boxes are bigger. Did not notice anything differnt in Metrics but I need to look over the changelog again.
In your opinion did I get through it ok should I be in good shape? All checks are running and graphing... everything seems to be working.
Anything you can think of that I should be checking to make sure something did not get upgraded properly?
****UPDATE****
Added searchable host field in graph explorer -JO -- not seeing this in Graph Explorer. See it in Host Graphs though. Am I looking in the wrong place?
Also in the "Scalable Performance Graph" when I select the 4 hour timeframe, it goes all wonky.. In the screenshot below you will see it has Jan 1 in it. All other timeframes work properly.
You do not have the required permissions to view the files attached to this post.