Performance Grapher Issues

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
Sarg0n
Posts: 39
Joined: Mon Jun 12, 2017 9:58 am

Performance Grapher Issues

Post by Sarg0n »

Hi Team,

I realize there is a plethora of forum posts about the performance grapher in Nagios XI not working, but I can't seem to find my specific problem. If I "systemctl status npcd" it shows as active, but the GUI front end shows the performance grapher shows red and if I scroll over it, it says "NPCD not running". Other system components that are down include: Database Maintenance, Command Subsystem, Event Manager, Feed Processor, Report Engine, Cleaner, Nonstop Operations Manager and System Statistics. The only two components showing green are Monitoring Engine and Database backend. What would cause all of those to go down, as in what do they have in common that would cause them all to fail?

Let me add that we ran out of space on the root directory last week, so I had to increase the size of the VM we have XI on and increase the partition. The server is back up and running, but something is keeping those components from working. Not sure if this is important info for diagnosis as I realize space issues may harm the database maybe?

Let me warn you that I can only type any results that you wish me to grab, as the systems are on a closed network. Thanks for your help.
npolovenko
Support Tech
Posts: 3457
Joined: Mon May 15, 2017 5:00 pm

Re: Performance Grapher Issues

Post by npolovenko »

Hello, @Sarg0n. NDO2DB probably crashed when the system ran out of space, but it left the lock file. So Nagios thinks that its running but in reality, it's not. Please run these commands in order:

Code: Select all

service nagios stop
service ndo2db stop
mv /usr/local/nagios/var/ndo2db.lock /usr/local/nagios/var/ndo2db.lock.bak
mv /usr/local/nagios/var/ndo.sock /usr/local/nagios/var/ndo.sock.bak
service nagios start
service ndo2db start
That should fix your problem.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Sarg0n
Posts: 39
Joined: Mon Jun 12, 2017 9:58 am

Re: Performance Grapher Issues

Post by Sarg0n »

Went to follow the directions but there is no ndo2db.lock or ndo.sock on the system.
npolovenko
Support Tech
Posts: 3457
Joined: Mon May 15, 2017 5:00 pm

Re: Performance Grapher Issues

Post by npolovenko »

Does this work?

Code: Select all

service ndo2db start
service ndo2db status
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Sarg0n
Posts: 39
Joined: Mon Jun 12, 2017 9:58 am

Re: Performance Grapher Issues

Post by Sarg0n »

Yes, and it says "OK" when restarted. But, still same results.
npolovenko
Support Tech
Posts: 3457
Joined: Mon May 15, 2017 5:00 pm

Re: Performance Grapher Issues

Post by npolovenko »

Can you navigate to the System Component Status page in XI, its under the Admin tab, and then next to the Database backend click on Action -> restart. Does that fix the issue?
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Sarg0n
Posts: 39
Joined: Mon Jun 12, 2017 9:58 am

Re: Performance Grapher Issues

Post by Sarg0n »

No, that's where I began. When you scroll over the system component it says "NPCD not running", but if I open a terminal and enter "systemctl status npcd", it shows active and running. That's what's confusing...so the lock issue you gave would make sense, but i don't see a .lock file for ndo2db. If you could point me to the correct log, maybe I could tail -f the results and see what's happening in the background while I try to start the performance grapher system component.
npolovenko
Support Tech
Posts: 3457
Joined: Mon May 15, 2017 5:00 pm

Re: Performance Grapher Issues

Post by npolovenko »

@Sarg0n, Sure, to start you can take a look at /usr/local/nagios/var/npcd.log and /var/log/messages
ALso, this command may help. I want to see if the lock and sock files were finally recreated.

Code: Select all

ls /usr/local/nagios/var
Also, try this procedure:

Code: Select all

service npcd stop
killall npcd
ps -aef | grep npcd
service npcd start
Then:

Code: Select all

service nagios restart
service httpd restart
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Locked