Some Items not Graphing

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
EchoUser
Posts: 14
Joined: Wed Sep 22, 2021 9:24 am

Some Items not Graphing

Post by EchoUser »

Hello Support,

We have several services that have stopped graphing the performance data. The performance data is still coming in, and other services are not having any issues.

In both the Nagios XI and the core interface it shows the performance data is coming in.

Can you advise on how to get the graphing to start again?
You do not have the required permissions to view the files attached to this post.
User avatar
pbroste
Posts: 1288
Joined: Tue Jun 01, 2021 1:27 pm

Re: Some Items not Graphing

Post by pbroste »

Hello @EchoUser

Thanks for reaching out, want to take a look at the System Profile so we can see what is going on.

To send us your system profile.
  • Login to the Nagios XI GUI using a web browser.
  • Click the "Admin" > "System Profile" Menu
  • Click the "Download Profile" button
  • Save the profile.zip file and send via Private Message
Thanks,
Perry
EchoUser
Posts: 14
Joined: Wed Sep 22, 2021 9:24 am

Re: Some Items not Graphing

Post by EchoUser »

PM sent.
User avatar
pbroste
Posts: 1288
Joined: Tue Jun 01, 2021 1:27 pm

Re: Some Items not Graphing

Post by pbroste »

Hello @EchoUser

Thanks for sending over the System Profile, we see events logged in the 'npcd.logs' , because you're getting things like:

NPCD: WARN: MAX load reached: load 12.800000

Let's start with increasing the following values in these files:

/usr/local/nagios/etc/pnp/process_perfdata.cfg:
TIMEOUT = 15

/usr/local/nagios/etc/pnp/npcd.cfg:
sleep_time = 10

Try a timeout of greater than 30 and a sleep time of (greater than) 15 and see if that improves things.

Then do:
systemctl restart npcd

Thanks,
Perry
EchoUser
Posts: 14
Joined: Wed Sep 22, 2021 9:24 am

Re: Some Items not Graphing

Post by EchoUser »

I have made the changes you suggested. I will review again tomorrow to see if items are graphing again.

Thanks.
EchoUser
Posts: 14
Joined: Wed Sep 22, 2021 9:24 am

Re: Some Items not Graphing

Post by EchoUser »

The change doesn't seem to have made the graphing start working again.
You do not have the required permissions to view the files attached to this post.
User avatar
pbroste
Posts: 1288
Joined: Tue Jun 01, 2021 1:27 pm

Re: Some Items not Graphing

Post by pbroste »

Hello @EchoUser

Thanks for following up, let's start by looking at the perfdata for 'DESKTOP-BELIKOV'; interested in current timestamps, and populated xml/rdd host and services perfdata.

Code: Select all

ls -lahrt /usr/local/nagios/share/perfdata/DESKTOP-BELIKOV/
If you are not concerned with historical perfdata let's go ahead and clear the .xml and .rdd:

Code: Select all

cd /usr/local/nagios/share/perfdata/DESKTOP-BELIKOV/
rm -rf <service_name.rrd>
rm -rf <service_name.xml>
Restart ncpd service:

Code: Select all

systemctl restart npcd.service
make sure it started:

Code: Select all

systemctl status npcd.service
Increase the perfdata logging:

Code: Select all

/usr/local/nagios/etc/pnp/npcd.cfg
Change:
log_level = 0
To:
log_level = -1

Nagios XI web console > Service Status, click on a "problem" service, go to the Advanced tab and show us a screenshot of the page. see example:
  • yourhostserviceadvanced.png
Then post the following files on the forum:

- /usr/local/nagios/etc/pnp/npcd.cfg
- /usr/local/nagios/etc/pnp/process_perfdata.cfg
- /usr/local/nagios/var/npcd.log
- /usr/local/nagios/var/perfdata.log

Onliner:

Code: Select all

cat /usr/local/nagios/etc/pnp/npcd.cfg > /tmp/results.txt && cat /usr/local/nagios/etc/pnp/process_perfdata.cfg >> /tmp/results.txt && cat /usr/local/nagios/var/npcd.log >> /tmp/results.txt && cat /usr/local/nagios/var/perfdata.log >> /tmp/results.txt && uptime >> /tmp/results.txt
Please send over the /tmp/results.txt

Thanks,
Perry
You do not have the required permissions to view the files attached to this post.
EchoUser
Posts: 14
Joined: Wed Sep 22, 2021 9:24 am

Re: Some Items not Graphing

Post by EchoUser »

ls -lahrt /usr/local/nagios/share/perfdata/DESKTOP-BELIKOV/
-rw-rw-r--. 1 nagios nagios 1.5M Jan 11 23:17 Memory_Usage.rrd
drwxrwxr-x. 111 nagios nagios 4.0K Jan 28 09:58 ..
-rw-rw-r-- 1 nagios nagios 2.2K Feb 1 15:47 CPU_Usage.xml
-rw-rw-r--. 1 nagios nagios 376K Feb 1 15:47 CPU_Usage.rrd
-rw-rw-r-- 1 nagios nagios 2.4K Feb 1 15:48 Paging_File_Usage.xml
-rw-rw-r--. 1 nagios nagios 376K Feb 1 15:48 Paging_File_Usage.rrd
-rw-rw-r--. 1 nagios nagios 1.1M Feb 1 15:48 Disk_Usage_on_C__.rrd
-rw-rw-r-- 1 nagios nagios 3.6K Feb 1 15:48 Disk_Usage_on_C__.xml
-rw-rw-r-- 1 nagios nagios 2.3K Feb 1 15:48 Check_Uptime.xml
-rw-rw-r--. 1 nagios nagios 376K Feb 1 15:48 Check_Uptime.rrd
-rw-rw-r-- 1 nagios nagios 2.3K Feb 1 15:48 Number_of_System_Threads.xml
-rw-rw-r--. 1 nagios nagios 376K Feb 1 15:48 Number_of_System_Threads.rrd
-rw-rw-r--. 1 nagios nagios 751K Feb 1 15:49 _HOST_.rrd
-rw-rw-r-- 1 nagios nagios 2.7K Feb 1 15:49 _HOST_.xml
-rw-rw-r-- 1 nagios nagios 5.0K Feb 1 15:49 Memory_Usage.xml
-rw-rw-r-- 1 nagios nagios 3.6K Feb 1 15:50 Swap_Usage.xml
-rw-rw-r--. 1 nagios nagios 1.1M Feb 1 15:50 Swap_Usage.rrd
drwxrwxr-x. 2 nagios nagios 4.0K Feb 1 15:50 .

It looks like it hasn't updated the .rrd file since the date that the graph stopped. We were prefer to keep the history if possible, as there may be other services that are having the same issue.

Restarting service:

Code: Select all

# systemctl restart npcd.service
[root@devqa-nagios ~]# systemctl status npcd.service
â— npcd.service - LSB: Nagios NPCD Initscript
   Loaded: loaded (/etc/rc.d/init.d/npcd; bad; vendor preset: disabled)
   Active: active (running) since Tue 2022-02-01 15:58:30 EST; 2min 3s ago
     Docs: man:systemd-sysv-generator(8)
  Process: 32046 ExecStop=/etc/rc.d/init.d/npcd stop (code=exited, status=0/SUCCESS)
  Process: 32052 ExecStart=/etc/rc.d/init.d/npcd start (code=exited, status=0/SUCCESS)
 Main PID: 32055 (npcd)
   CGroup: /system.slice/npcd.service
           └─32055 /usr/local/nagios/bin/npcd -d -f /usr/local/nagios/etc/pnp/npcd.cfg

Feb 01 15:58:30 devqa-nagios systemd[1]: Stopped LSB: Nagios NPCD Initscript.
Feb 01 15:58:30 devqa-nagios systemd[1]: Starting LSB: Nagios NPCD Initscript...
Feb 01 15:58:30 devqa-nagios npcd[32052]: NPCD started.
Feb 01 15:58:30 devqa-nagios systemd[1]: Failed to parse PID from file /usr/local/nagiosxi/var/subsys/npcd.pid: Invalid argument
Feb 01 15:58:30 devqa-nagios systemd[1]: Started LSB: Nagios NPCD Initscript.
See attached screenshot.

Results sent via PM.
You do not have the required permissions to view the files attached to this post.
User avatar
pbroste
Posts: 1288
Joined: Tue Jun 01, 2021 1:27 pm

Re: Some Items not Graphing

Post by pbroste »

Hello @EchoUser

Thanks for sending over the results, want to have you move the data in your '/usr/local/nagios/share/perfdata/DESKTOP-BELIKOV/' to a temp directory and watch it repopulate:

Code: Select all

mkdir -p /tmp/usr/local/nagios/share/perfdata/DESKTOP-BELIKOV && mv /usr/local/nagios/share/perfdata/DESKTOP-BELIKOV/* /tmp/usr/local/nagios/share/perfdata/DESKTOP-BELIKOV/
Then watch the directory:

Code: Select all

watch -n 3 ls /usr/local/nagios/share/perfdata/DESKTOP-BELIKOV/*
Wait for 30 minutes or so, then check perfdata graph.

The results that were sent over are missing logging and want to have you increase the logging.

https://support.nagios.com/kb/article.php?id=9


Thanks,
Perry
EchoUser
Posts: 14
Joined: Wed Sep 22, 2021 9:24 am

Re: Some Items not Graphing

Post by EchoUser »

After moving the content, the graph has started working, but the historical data is now no longer present (from before it stopped working). Is there a way to re-import that data back?
Locked