Page 1 of 1

Nagios performance engine keeps crashing

Posted: Tue Aug 01, 2017 3:39 pm
by vuduops
Nagios performance engine keeps crashing.

Code: Select all

[08-01-2017 20:08:58] NPCD: npcd Daemon (0.4.14) started with PID=30016
[08-01-2017 20:08:58] NPCD: Please have a look at 'npcd -V' to get license information
[08-01-2017 20:08:58] NPCD: HINT: load_threshold is enabled - ('10.000000')
[08-01-2017 20:09:54] NPCD: WARN: MAX load reached: load 11.280000/10.000000 at i=337
[08-01-2017 20:11:43] NPCD: WARN: MAX load reached: load 10.380000/10.000000 at i=927
[08-01-2017 20:13:43] NPCD: Could not create thread... exiting with error 'Resource temporarily unavailable'
[08-01-2017 20:35:15] NPCD: npcd Daemon (0.4.14) started with PID=6302
[08-01-2017 20:35:15] NPCD: Please have a look at 'npcd -V' to get license information
[08-01-2017 20:35:15] NPCD: HINT: load_threshold is enabled - ('10.000000')
[08-01-2017 20:35:43] NPCD: Could not create thread... exiting with error 'Resource temporarily unavailable
-Krishna

Re: Nagios performance engine keeps crashing

Posted: Tue Aug 01, 2017 4:21 pm
by cdienger
Is the load always this high? How many cpus are allocated to this server? You can run

Code: Select all

grep processor /proc/cpuinfo  | wc -l 
to get the number.

You can set the load_threshold threshold variable in /root/etc/pnp/npcd.cfg which is 10 by default to work around this.

Re: Nagios performance engine keeps crashing

Posted: Wed Aug 02, 2017 9:31 am
by vuduops
I increased the load threshold to 16 but I still see the issue ?

Code: Select all

[[email protected] ~]#     grep processor /proc/cpuinfo  | wc -l 
16

[[email protected] ~]# cat /usr/local/nagios/etc/pnp/npcd.cfg | grep load
# use_load_threshold - enables/disables load watching
# use_load_threshold = <0 / 1> (default: 0)
#use_load_threshold = 0
# load_threshold - npcd won't start new threads
# if your system load is over this threshold
# load_threshold = <float value> (default: 10.0)
load_threshold = 16.0


Re: Nagios performance engine keeps crashing

Posted: Wed Aug 02, 2017 12:57 pm
by tmcdonald
The correct file location should actually be /usr/local/nagios/etc/pnp/npcd.cfg as you have, and you need to run service npcd restart after for it to take effect.

I agree though, a load of 16 is somewhat high. How many hosts/services total are you checking? Is there any other application running on this server?

Re: Nagios performance engine keeps crashing

Posted: Mon Aug 07, 2017 5:54 pm
by vuduops
I have attached the screen shot for you reference. I have commented out the cpu_threshold to see what happens.

-Krishna

Re: Nagios performance engine keeps crashing

Posted: Tue Aug 08, 2017 1:44 pm
by tmcdonald
Any changes to report?