Page 3 of 3
Re: Unable to get 'nagios_logentries' DB to recover
Posted: Wed Oct 02, 2019 11:42 am
by rferebee
Code: Select all
root@nagiostest:/root>ps -ef|grep nagios.cfg
root 42525 42449 0 09:41 pts/0 00:00:00 grep --color=auto nagios.cfg
nagios 95310 1 1 00:59 ? 00:05:32 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
nagios 95325 95310 0 00:59 ? 00:00:01 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg
Re: Unable to get 'nagios_logentries' DB to recover
Posted: Wed Oct 02, 2019 12:07 pm
by scottwilkerson
I am really at a loss, If you go to the Advanced tab for this service and submit a passive check of OK, does the CRITICAL state return?
Re: Unable to get 'nagios_logentries' DB to recover
Posted: Wed Oct 02, 2019 12:16 pm
by rferebee
I submitted the passive check, they all went green and then once the next check ran they all went Critical again with the same error.
Re: Unable to get 'nagios_logentries' DB to recover
Posted: Wed Oct 02, 2019 12:55 pm
by scottwilkerson
You aren't by chance using mod_gearman are you?
Re: Unable to get 'nagios_logentries' DB to recover
Posted: Wed Oct 02, 2019 12:57 pm
by scottwilkerson
Also, are we looking at the same server in the GUI that we are running the commands from the CLI?
Re: Unable to get 'nagios_logentries' DB to recover
Posted: Wed Oct 02, 2019 1:07 pm
by rferebee
Ok. I have to apologize. I was running all the commands on my test server because I assumed the issue with the database was occurring on all 3 of my XI servers.
The same service check was returning an error for all 3 servers, so I thought I would prove the fix in test before applying it to production.
What I'm confused about. If the table was only broken on the production server, why would that effect the checks for my fail over and test servers?
Re: Unable to get 'nagios_logentries' DB to recover
Posted: Wed Oct 02, 2019 4:34 pm
by scottwilkerson
rferebee wrote:Ok. I have to apologize. I was running all the commands on my test server because I assumed the issue with the database was occurring on all 3 of my XI servers.
The same service check was returning an error for all 3 servers, so I thought I would prove the fix in test before applying it to production.
What I'm confused about. If the table was only broken on the production server, why would that effect the checks for my fail over and test servers?
It's possible it was broken on all 3...
But then the repair would need to be done on all 3
Re: Unable to get 'nagios_logentries' DB to recover
Posted: Wed Oct 02, 2019 4:36 pm
by rferebee
I'm not 100% sure. All I know is that I ran the repair on my production server and it resolved the issue for all three servers. Strange.
Anyway, go ahead and lock this up. Thanks for your help.
Re: Unable to get 'nagios_logentries' DB to recover
Posted: Wed Oct 02, 2019 4:37 pm
by scottwilkerson
rferebee wrote:I'm not 100% sure. All I know is that I ran the repair on my production server and it resolved the issue for all three servers. Strange.
Anyway, go ahead and lock this up. Thanks for your help.
Great!
Locking