Page 1 of 1
All graphs are showing no performance data in Nagios XI
Posted: Wed Sep 16, 2020 9:42 am
by Jagannadharao
Dear Team,
We are observing no data in performance graphs for all services and hosts.
Also observing that no graph icon showing newly adding servers into monitoring.
Please refer below attached screenshot for more details.
Can you help to advise quickly as no performance stats available for services and hosts in monitoring?
Re: All graphs are showing no performance data in Nagios XI
Posted: Thu Sep 17, 2020 1:53 pm
by benjaminsmith
Hi,
Welcome to the Customer Support Forum!
This could happen if the npcd service is not running, so let's check that first.
If that's running, then please step through the guide below to check the spool count on the files. It might be necessary to increase the default thresholds, explained in the guide as well and let me know if the issue is resolved.
Nagios XI - Performance Graph Problems
Regards,
Benjamin
Re: All graphs are showing no performance data in Nagios XI
Posted: Fri Sep 18, 2020 3:53 am
by Jagannadharao
Hi Benjamin,
Here is the output for the command "systemctl status npcd"
$ systemctl status npcd
● npcd.service - LSB: Nagios NPCD Initscript
Loaded: loaded (/etc/rc.d/init.d/npcd; bad; vendor preset: disabled)
Active: active (running) since Fri 2020-09-11 01:02:01 +08; 1 weeks 0 days ago
Docs: man:systemd-sysv-generator(8)
Process: 22165 ExecStop=/etc/rc.d/init.d/npcd stop (code=exited, status=0/SUCCESS)
Process: 45110 ExecStart=/etc/rc.d/init.d/npcd start (code=exited, status=0/SUCCESS)
Main PID: 22184 (npcd)
CGroup: /system.slice/npcd.service
‣ 22184 /usr/local/nagios/bin/npcd -d -f /usr/local/nagios/etc/pnp/npcd.cfg
$
Re: All graphs are showing no performance data in Nagios XI
Posted: Fri Sep 18, 2020 6:08 am
by Jagannadharao
npcd.cfg: Threshold value is changed from 10 to 28 as it was reaching threshold breach with 10.
load_threshold = 10.0 to load_threshold = 28.0
process_perfdata.cfg: TIMEOUT value is changed from 5 to 40.
TIMEOUT = 5 to TIMEOUT = 40
Observing following errors after changing these values:
[root@nagprdappls004 pnp]# grep "MAX load reached" /usr/local/nagios/var/npcd.log | head
[09-09-2020 02:22:51] NPCD: WARN: MAX load reached: load 24.490000/10.000000 at i=1
[09-09-2020 02:23:06] NPCD: WARN: MAX load reached: load 25.620000/10.000000 at i=1
[09-09-2020 02:23:21] NPCD: WARN: MAX load reached: load 24.360000/10.000000 at i=1
[09-09-2020 02:23:36] NPCD: WARN: MAX load reached: load 23.690000/10.000000 at i=1
[09-09-2020 02:23:51] NPCD: WARN: MAX load reached: load 22.210000/10.000000 at i=1
[09-09-2020 02:24:06] NPCD: WARN: MAX load reached: load 21.840000/10.000000 at i=1
[09-09-2020 02:24:21] NPCD: WARN: MAX load reached: load 23.180000/10.000000 at i=1
[09-09-2020 02:24:36] NPCD: WARN: MAX load reached: load 23.370000/10.000000 at i=1
[09-09-2020 02:24:51] NPCD: WARN: MAX load reached: load 28.460000/10.000000 at i=1
[09-09-2020 02:25:06] NPCD: WARN: MAX load reached: load 29.530000/10.000000 at i=1
[root@nagprdappls004 pnp]# grep "MAX load reached" /usr/local/nagios/var/npcd.log | tail
[09-18-2020 18:38:23] NPCD: WARN: MAX load reached: load 23.340000/10.000000 at i=1
[09-18-2020 18:38:38] NPCD: WARN: MAX load reached: load 23.790000/10.000000 at i=1
[09-18-2020 18:38:53] NPCD: WARN: MAX load reached: load 23.770000/10.000000 at i=1
[09-18-2020 18:39:08] NPCD: WARN: MAX load reached: load 23.340000/10.000000 at i=1
[09-18-2020 18:39:23] NPCD: WARN: MAX load reached: load 23.510000/10.000000 at i=1
[09-18-2020 18:39:38] NPCD: WARN: MAX load reached: load 24.070000/10.000000 at i=1
[09-18-2020 18:39:53] NPCD: WARN: MAX load reached: load 23.300000/10.000000 at i=1
[09-18-2020 18:40:08] NPCD: WARN: MAX load reached: load 23.980000/10.000000 at i=1
[09-18-2020 18:40:23] NPCD: WARN: MAX load reached: load 23.240000/10.000000 at i=1
[09-18-2020 18:48:47] NPCD: WARN: MAX load reached: load 28.330000/28.000000 at i=7
[root@nagprdappls004 pnp]#
[root@nagprdappls004 pnp]# tail -f /usr/local/nagios/var/npcd.log
[09-18-2020 18:53:34] NPCD: ERROR: Executed command exits with return code '7'
[09-18-2020 18:53:34] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1600426256.perfdata.service'
[09-18-2020 18:53:34] NPCD: ERROR: Executed command exits with return code '7'
[09-18-2020 18:53:34] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1600426241.perfdata.service'
[09-18-2020 18:54:14] NPCD: ERROR: Executed command exits with return code '7'
[09-18-2020 18:54:14] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1600426271.perfdata.service'
[09-18-2020 18:54:54] NPCD: ERROR: Executed command exits with return code '7'
[09-18-2020 18:54:54] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1600426331.perfdata.service'
[09-18-2020 18:57:28] NPCD: ERROR: Executed command exits with return code '7'
[09-18-2020 18:57:28] NPCD: ERROR: Command line was '/usr/local/nagios/libexec/process_perfdata.pl -n -b /usr/local/nagios/var/spool/perfdata//1600426451.perfdata.service'
[root@nagprdappls004 pnp]# tail -25 /usr/local/nagios/var/perfdata.log
2020-09-18 18:53:34 [127341] [0] *** process_perfdata.pl terminated on signal ALRM
2020-09-18 18:53:34 [127343] [0] *** TIMEOUT: Timeout after 40 secs. ***
2020-09-18 18:53:34 [127343] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2020-09-18 18:53:34 [127343] [0] *** TIMEOUT: Please check your npcd.cfg
2020-09-18 18:53:34 [127343] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1600426241.perfdata.service-PID-127343 deleted
2020-09-18 18:53:34 [127343] [0] *** Timeout while processing Host: "skye-nxge0" Service: "_oftpr_u02"
2020-09-18 18:53:34 [127343] [0] *** process_perfdata.pl terminated on signal ALRM
2020-09-18 18:54:14 [776] [0] *** TIMEOUT: Timeout after 40 secs. ***
2020-09-18 18:54:14 [776] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2020-09-18 18:54:14 [776] [0] *** TIMEOUT: Please check your npcd.cfg
2020-09-18 18:54:14 [776] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1600426271.perfdata.service-PID-776 deleted
2020-09-18 18:54:14 [776] [0] *** Timeout while processing Host: "cespetetlls002" Service: "Free_Swap_Space"
2020-09-18 18:54:14 [776] [0] *** process_perfdata.pl terminated on signal ALRM
2020-09-18 18:54:54 [4961] [0] *** TIMEOUT: Timeout after 40 secs. ***
2020-09-18 18:54:54 [4961] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2020-09-18 18:54:54 [4961] [0] *** TIMEOUT: Please check your npcd.cfg
2020-09-18 18:54:54 [4961] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1600426331.perfdata.service-PID-4961 deleted
2020-09-18 18:54:54 [4961] [0] *** Timeout while processing Host: "fmssitodba001" Service: "_u14"
2020-09-18 18:54:54 [4961] [0] *** process_perfdata.pl terminated on signal ALRM
2020-09-18 18:57:28 [20265] [0] *** TIMEOUT: Timeout after 40 secs. ***
2020-09-18 18:57:28 [20265] [0] *** TIMEOUT: Deleting current file to avoid NPCD loops
2020-09-18 18:57:28 [20265] [0] *** TIMEOUT: Please check your npcd.cfg
2020-09-18 18:57:28 [20265] [0] *** TIMEOUT: /usr/local/nagios/var/spool/perfdata//1600426451.perfdata.service-PID-20265 deleted
2020-09-18 18:57:28 [20265] [0] *** Timeout while processing Host: "ssopptoamw004" Service: "Windows_Memory_Usage"
2020-09-18 18:57:28 [20265] [0] *** process_perfdata.pl terminated on signal ALRM
[root@nagprdappls004 pnp]#
Hope these details helpful to advise further.
Thank you.
Re: All graphs are showing no performance data in Nagios XI
Posted: Fri Sep 18, 2020 4:59 pm
by benjaminsmith
Hi,
Those errors may be logged if the file was processed by another thread. Are the performance graphs working for any of the services?
If not, let's increase the log level on NPCD. You can enable debugging for npcd by setting:
in /usr/local/nagios/etc/pnp/process_perfdata.cfg and then restart the npcd process.
Then tail the log file and post any errors to the thread.
Code: Select all
tail -f /usr/local/nagios/var/perfdata.log
Also, how's the count of spooled files looking?
Code: Select all
ls /usr/local/nagios/var/spool/perfdata/ | wc -l
ls /usr/local/nagios/var/spool/xidpe/ | wc -l
Regards,
Benjamin
Re: All graphs are showing no performance data in Nagios XI
Posted: Sat Sep 19, 2020 6:57 am
by Jagannadharao
Hi Benjimin,
All Graphs are working fine after above mentioned settings put in place.
Thank you.
Best Regards,
Jagan
Re: All graphs are showing no performance data in Nagios XI
Posted: Mon Sep 21, 2020 6:31 am
by scottwilkerson
Jagannadharao wrote:Hi Benjimin,
All Graphs are working fine after above mentioned settings put in place.
Thank you.
Best Regards,
Jagan
Great!
Locking thread