Production Instance of Nagios gone

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
tgfde
Posts: 233
Joined: Thu May 12, 2011 7:55 am

Production Instance of Nagios gone

Post by tgfde »

Hi,

Our prod instance of nagios disappeared.

We were receiving DB error so I restarted nagios and now it is the new install page as if we are doing a new install. Please help.

CentOS 5.6 vmWare
NagiosXI2011 R1.2

Thanks.
tgfde
Posts: 233
Joined: Thu May 12, 2011 7:55 am

Re: Production Instance of Nagios gone

Post by tgfde »

Here's the DB message I received before restarting Nagios.

DB Connect Error [nagiosxi]: Database connection failed

Thanks.
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Production Instance of Nagios gone

Post by mguthrie »

Try:

Code: Select all

service postgresql restart
If postgresql isn't running, then the "installed" flag won't be able to save.
tgfde
Posts: 233
Joined: Thu May 12, 2011 7:55 am

Re: Production Instance of Nagios gone

Post by tgfde »

Hi mguthire,

I tried that but did not work, I also restarted the Nagios host (VM).

Nagios is working fine I just can't log on to the UI due to the DB error.

Thanks.
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Production Instance of Nagios gone

Post by mguthrie »

I tried that but did not work
What kind of output did it display? I still don't know if postgres is even running.

If it is running, see if you can connect to it with:

Code: Select all

psql nagiosxi nagiosxi
Have you made any system or firewall changes recently?
tgfde
Posts: 233
Joined: Thu May 12, 2011 7:55 am

Re: Production Instance of Nagios gone

Post by tgfde »

Postgres is running.


psql nagiosxi nagiosxi
psql: FATAL: database is not accepting commands to avoid wraparound data loss in database "postgres"
HINT: Stop the postmaster and use a standalone backend to vacuum database "postgres".
[root@corpmon6 ~]#
[root@corpmon6 ~]# service nagios status
nagios (pid 3681) is running...
[root@corpmon6 ~]# service postgresql status
postmaster (pid 30106 30105 30104 30102 30100) is running...
[root@corpmon6 ~]# service postgresql restart
Stopping postgresql service: [ OK ]
Starting postgresql service: [ OK ]
[root@corpmon6 ~]#
tgfde
Posts: 233
Joined: Thu May 12, 2011 7:55 am

Re: Production Instance of Nagios gone

Post by tgfde »

Sorrry I forgot to answer your question.

No firewall or system changes.

Thanks.
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Production Instance of Nagios gone

Post by mguthrie »

Check to make sure you didn't run out of hard drive space.

If not, see if it will let you run a vacuum command:

Code: Select all

psql nagiosxi nagiosxi -c 'vacuum;'
tgfde
Posts: 233
Joined: Thu May 12, 2011 7:55 am

Re: Production Instance of Nagios gone

Post by tgfde »

I still have space available and ran the command below.

psql nagiosxi nagiosxi -c 'vacuum;'
psql: FATAL: database is not accepting commands to avoid wraparound data loss in database "postgres"
HINT: Stop the postmaster and use a standalone backend to vacuum database "postgres".


Thanks.
tgfde
Posts: 233
Joined: Thu May 12, 2011 7:55 am

Re: Production Instance of Nagios gone

Post by tgfde »

mguthrie,

Could you tell me how to acknowledge a notification from the command line while we troubleshoot this issue? Also, is it possible to do a remote session?

Thanks.
Locked