Hi Team,
I realize there is a plethora of forum posts about the performance grapher in Nagios XI not working, but I can't seem to find my specific problem. If I "systemctl status npcd" it shows as active, but the GUI front end shows the performance grapher shows red and if I scroll over it, it says "NPCD not running". Other system components that are down include: Database Maintenance, Command Subsystem, Event Manager, Feed Processor, Report Engine, Cleaner, Nonstop Operations Manager and System Statistics. The only two components showing green are Monitoring Engine and Database backend. What would cause all of those to go down, as in what do they have in common that would cause them all to fail?
Let me add that we ran out of space on the root directory last week, so I had to increase the size of the VM we have XI on and increase the partition. The server is back up and running, but something is keeping those components from working. Not sure if this is important info for diagnosis as I realize space issues may harm the database maybe?
Let me warn you that I can only type any results that you wish me to grab, as the systems are on a closed network. Thanks for your help.
Performance Grapher Issues
-
npolovenko
- Support Tech
- Posts: 3457
- Joined: Mon May 15, 2017 5:00 pm
Re: Performance Grapher Issues
Hello, @Sarg0n. NDO2DB probably crashed when the system ran out of space, but it left the lock file. So Nagios thinks that its running but in reality, it's not. Please run these commands in order:
That should fix your problem.
Code: Select all
service nagios stop
service ndo2db stop
mv /usr/local/nagios/var/ndo2db.lock /usr/local/nagios/var/ndo2db.lock.bak
mv /usr/local/nagios/var/ndo.sock /usr/local/nagios/var/ndo.sock.bak
service nagios start
service ndo2db startAs of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Re: Performance Grapher Issues
Went to follow the directions but there is no ndo2db.lock or ndo.sock on the system.
-
npolovenko
- Support Tech
- Posts: 3457
- Joined: Mon May 15, 2017 5:00 pm
Re: Performance Grapher Issues
Does this work?
Code: Select all
service ndo2db start
service ndo2db statusAs of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Re: Performance Grapher Issues
Yes, and it says "OK" when restarted. But, still same results.
-
npolovenko
- Support Tech
- Posts: 3457
- Joined: Mon May 15, 2017 5:00 pm
Re: Performance Grapher Issues
Can you navigate to the System Component Status page in XI, its under the Admin tab, and then next to the Database backend click on Action -> restart. Does that fix the issue?
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Re: Performance Grapher Issues
No, that's where I began. When you scroll over the system component it says "NPCD not running", but if I open a terminal and enter "systemctl status npcd", it shows active and running. That's what's confusing...so the lock issue you gave would make sense, but i don't see a .lock file for ndo2db. If you could point me to the correct log, maybe I could tail -f the results and see what's happening in the background while I try to start the performance grapher system component.
-
npolovenko
- Support Tech
- Posts: 3457
- Joined: Mon May 15, 2017 5:00 pm
Re: Performance Grapher Issues
@Sarg0n, Sure, to start you can take a look at /usr/local/nagios/var/npcd.log and /var/log/messages
ALso, this command may help. I want to see if the lock and sock files were finally recreated.
Also, try this procedure:
Then:
ALso, this command may help. I want to see if the lock and sock files were finally recreated.
Code: Select all
ls /usr/local/nagios/varCode: Select all
service npcd stop
killall npcd
ps -aef | grep npcd
service npcd start
Code: Select all
service nagios restart
service httpd restartAs of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.