Page 1 of 3

Production Instance of Nagios gone

Posted: Mon Jan 02, 2012 12:07 pm
by tgfde
Hi,

Our prod instance of nagios disappeared.

We were receiving DB error so I restarted nagios and now it is the new install page as if we are doing a new install. Please help.

CentOS 5.6 vmWare
NagiosXI2011 R1.2

Thanks.

Re: Production Instance of Nagios gone

Posted: Mon Jan 02, 2012 12:16 pm
by tgfde
Here's the DB message I received before restarting Nagios.

DB Connect Error [nagiosxi]: Database connection failed

Thanks.

Re: Production Instance of Nagios gone

Posted: Mon Jan 02, 2012 1:42 pm
by mguthrie
Try:

Code: Select all

service postgresql restart
If postgresql isn't running, then the "installed" flag won't be able to save.

Re: Production Instance of Nagios gone

Posted: Mon Jan 02, 2012 1:57 pm
by tgfde
Hi mguthire,

I tried that but did not work, I also restarted the Nagios host (VM).

Nagios is working fine I just can't log on to the UI due to the DB error.

Thanks.

Re: Production Instance of Nagios gone

Posted: Mon Jan 02, 2012 2:17 pm
by mguthrie
I tried that but did not work
What kind of output did it display? I still don't know if postgres is even running.

If it is running, see if you can connect to it with:

Code: Select all

psql nagiosxi nagiosxi
Have you made any system or firewall changes recently?

Re: Production Instance of Nagios gone

Posted: Mon Jan 02, 2012 2:38 pm
by tgfde
Postgres is running.


psql nagiosxi nagiosxi
psql: FATAL: database is not accepting commands to avoid wraparound data loss in database "postgres"
HINT: Stop the postmaster and use a standalone backend to vacuum database "postgres".
[root@corpmon6 ~]#
[root@corpmon6 ~]# service nagios status
nagios (pid 3681) is running...
[root@corpmon6 ~]# service postgresql status
postmaster (pid 30106 30105 30104 30102 30100) is running...
[root@corpmon6 ~]# service postgresql restart
Stopping postgresql service: [ OK ]
Starting postgresql service: [ OK ]
[root@corpmon6 ~]#

Re: Production Instance of Nagios gone

Posted: Mon Jan 02, 2012 3:05 pm
by tgfde
Sorrry I forgot to answer your question.

No firewall or system changes.

Thanks.

Re: Production Instance of Nagios gone

Posted: Mon Jan 02, 2012 4:17 pm
by mguthrie
Check to make sure you didn't run out of hard drive space.

If not, see if it will let you run a vacuum command:

Code: Select all

psql nagiosxi nagiosxi -c 'vacuum;'

Re: Production Instance of Nagios gone

Posted: Mon Jan 02, 2012 6:02 pm
by tgfde
I still have space available and ran the command below.

psql nagiosxi nagiosxi -c 'vacuum;'
psql: FATAL: database is not accepting commands to avoid wraparound data loss in database "postgres"
HINT: Stop the postmaster and use a standalone backend to vacuum database "postgres".


Thanks.

Re: Production Instance of Nagios gone

Posted: Tue Jan 03, 2012 8:34 am
by tgfde
mguthrie,

Could you tell me how to acknowledge a notification from the command line while we troubleshoot this issue? Also, is it possible to do a remote session?

Thanks.