Thanks - results & comments below.
Summary: still no change - still seeing the same issue.
/usr/local/nagiosxi/scripts/reset_config_perms.sh
I've done that many times previously (also mentioned in OP), and doing it again didn't change anything.
Done - no change. FYI I had already mentioned looking in
/var/log/secure for any
sudo failures and there are none.
Here's a tip... after you download the XI install tarball and extract the
nagiosxi.sudoers you can easily compare it to what's live in /etc/sudoers with this:
Code: Select all
# grep NAGIOS /tmp/nagiosxi.sudoers | sha256sum
02bbaec27094faad6d4aa9ed633500e2504ca4bc1a00c31b786c9c32e25dad08 -
# grep NAGIOS /etc/sudoers | sha256sum
02bbaec27094faad6d4aa9ed633500e2504ca4bc1a00c31b786c9c32e25dad08 -
That's safer than manipulating the live /etc/sudoers on a running production server.
Also your instructions should include running
visudo -c at the end, to make sure there's no typos.
Finally, run the following commands:
Code: Select all
service crond stop
service npcd stop
service nagios stop
service ndo2db stop
pkill -9 -u nagios
for i in $(ipcs -q | grep nagios |awk '{print $2}'); do ipcrm -q $i; done
rm -rf /usr/local/nagiosxi/var/dbmaint.lock
rm -rf /usr/local/nagiosxi/var/event_handler.lock
rm -rf /usr/local/nagiosxi/scripts/reconfigure_nagios.lock
service mysqld restart
service ndo2db start
service nagios start
service npcd start
service crond start
Done - no change. Before performing those steps I did see more than one message queue (there were two with Nagios still running) from
ipcs -q but after removing it still no change in the problem condition.
Please note this MySQL is NOT local, it's on an AWS RDS, therefore I did
not run the
service mysqld restart as you listed. (You should probably add an instruction to those steps so that a novice admin doesn't accidentally start up MySQL on their Nagios server with potentially confusing if not security-related consequences.)
If the instrcutions above don't work run the following command and upload the support.txt file from the /tmp/ folder.
Code: Select all
ls -lR /usr/local/nagios/ > /tmp/support.txt
Since there's still no change I'll PM you that output.