Nagios Xi sending out false alerts after server migration

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: Nagios Xi sending out false alerts after server migratio

Post by tgriep »

This error
DETAIL: Process 3646 waits for ShareUpdateExclusiveLock on relation 16424 of database 16385.
Is from the Posrgress service that thinks there is a process accessing the database but the process may not actually be running.
The only way to resolve that is to reboot the server.

This message
LOG: checkpoints are occurring too frequently (4 seconds apart)
HINT: Consider increasing the configuration parameter "checkpoint_segments".
is a configuration setting in the Postgress configuration file

Code: Select all

/var/lib/pgsql/data/postgresql.conf
Edit it and increase it and restart postgress until the message goes away.
If might be caused by the share lock issue so you could wait until the lock is fixed.
Be sure to check out our Knowledgebase for helpful articles and solutions!
krobertson71
Posts: 444
Joined: Tue Feb 11, 2014 10:16 pm

Re: Nagios Xi sending out false alerts after server migratio

Post by krobertson71 »

Would this cause the random false errors and php session time out in the gui after only a couple of minutes?

Already checked in the php.ini and timeout is set at 1440 already
krobertson71
Posts: 444
Joined: Tue Feb 11, 2014 10:16 pm

Re: Nagios Xi sending out false alerts after server migratio

Post by krobertson71 »

tgriep wrote:This error
DETAIL: Process 3646 waits for ShareUpdateExclusiveLock on relation 16424 of database 16385.
Is from the Posrgress service that thinks there is a process accessing the database but the process may not actually be running.
The only way to resolve that is to reboot the server.

This message
LOG: checkpoints are occurring too frequently (4 seconds apart)
HINT: Consider increasing the configuration parameter "checkpoint_segments".
is a configuration setting in the Postgress configuration file

Code: Select all

/var/lib/pgsql/data/postgresql.conf
Edit it and increase it and restart postgress until the message goes away.
If might be caused by the share lock issue so you could wait until the lock is fixed.
Also, the server has been rebooted a few times yesterday while testing so that did not resolve the issue.
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: Nagios Xi sending out false alerts after server migratio

Post by tgriep »

Yes, the server having issues connecting to the Postgress database could cause the issue you are seeing as it cannot update the data.
Try the following procedure to see if we can clear out the postgress database and fix the lock.

Code: Select all

service nagios stop
service ndo2db stop
service crond stop
service postgresql restart
pkill -9 -u nagios
echo "truncate table xi_events; truncate table xi_meta; truncate table xi_eventqueue;" | psql nagiosxi nagiosxi
service crond start
service ndo2db start
service nagios start
service npcd restart
Give the system a few minutes to run and if the issue is still there, post the following files.

Code: Select all

/var/lib/pgsql/data/postgresql.conf
/etc/php.ini
/etc/security/limits.conf
The run this and post the output.

Code: Select all

tail -100 /var/lib/pgsql/data/pg_log/postgresql-Thu.log
Be sure to check out our Knowledgebase for helpful articles and solutions!
Locked