Page 1 of 1

Performance Grapher Issues

Posted: Wed Feb 14, 2018 11:21 am
by Sarg0n
Hi Team,

I realize there is a plethora of forum posts about the performance grapher in Nagios XI not working, but I can't seem to find my specific problem. If I "systemctl status npcd" it shows as active, but the GUI front end shows the performance grapher shows red and if I scroll over it, it says "NPCD not running". Other system components that are down include: Database Maintenance, Command Subsystem, Event Manager, Feed Processor, Report Engine, Cleaner, Nonstop Operations Manager and System Statistics. The only two components showing green are Monitoring Engine and Database backend. What would cause all of those to go down, as in what do they have in common that would cause them all to fail?

Let me add that we ran out of space on the root directory last week, so I had to increase the size of the VM we have XI on and increase the partition. The server is back up and running, but something is keeping those components from working. Not sure if this is important info for diagnosis as I realize space issues may harm the database maybe?

Let me warn you that I can only type any results that you wish me to grab, as the systems are on a closed network. Thanks for your help.

Re: Performance Grapher Issues

Posted: Wed Feb 14, 2018 2:32 pm
by npolovenko
Hello, @Sarg0n. NDO2DB probably crashed when the system ran out of space, but it left the lock file. So Nagios thinks that its running but in reality, it's not. Please run these commands in order:

Code: Select all

service nagios stop
service ndo2db stop
mv /usr/local/nagios/var/ndo2db.lock /usr/local/nagios/var/ndo2db.lock.bak
mv /usr/local/nagios/var/ndo.sock /usr/local/nagios/var/ndo.sock.bak
service nagios start
service ndo2db start
That should fix your problem.

Re: Performance Grapher Issues

Posted: Wed Feb 14, 2018 3:39 pm
by Sarg0n
Went to follow the directions but there is no ndo2db.lock or ndo.sock on the system.

Re: Performance Grapher Issues

Posted: Wed Feb 14, 2018 3:41 pm
by npolovenko
Does this work?

Code: Select all

service ndo2db start
service ndo2db status

Re: Performance Grapher Issues

Posted: Wed Feb 14, 2018 4:03 pm
by Sarg0n
Yes, and it says "OK" when restarted. But, still same results.

Re: Performance Grapher Issues

Posted: Wed Feb 14, 2018 5:46 pm
by npolovenko
Can you navigate to the System Component Status page in XI, its under the Admin tab, and then next to the Database backend click on Action -> restart. Does that fix the issue?

Re: Performance Grapher Issues

Posted: Thu Feb 15, 2018 9:15 am
by Sarg0n
No, that's where I began. When you scroll over the system component it says "NPCD not running", but if I open a terminal and enter "systemctl status npcd", it shows active and running. That's what's confusing...so the lock issue you gave would make sense, but i don't see a .lock file for ndo2db. If you could point me to the correct log, maybe I could tail -f the results and see what's happening in the background while I try to start the performance grapher system component.

Re: Performance Grapher Issues

Posted: Thu Feb 15, 2018 2:33 pm
by npolovenko
@Sarg0n, Sure, to start you can take a look at /usr/local/nagios/var/npcd.log and /var/log/messages
ALso, this command may help. I want to see if the lock and sock files were finally recreated.

Code: Select all

ls /usr/local/nagios/var
Also, try this procedure:

Code: Select all

service npcd stop
killall npcd
ps -aef | grep npcd
service npcd start
Then:

Code: Select all

service nagios restart
service httpd restart