Page 1 of 2

Nagios Core Alerting Current load

Posted: Tue Jun 19, 2018 7:24 pm
by skypete
Hi I have Been getting alerts stating current load from nagios core. This has been of a lot lately not sure what the issue it is. It is a physical server and has good amount of cpu power and memory. Can some point me in the right direction to resolve this been a pain of late. Thanks

Re: Nagios Core Alerting Current load

Posted: Wed Jun 20, 2018 10:31 am
by scottwilkerson
Can you be more specific? what load is it reaching? How many CPU's does the system have, what is the output of wa when running

Code: Select all

top
What are the results of

Code: Select all

ps aux

Re: Nagios Core Alerting Current load

Posted: Wed Jun 20, 2018 2:13 pm
by skypete
Hi it has 16 cpu's the load average is showing now 1.26,4,39 4,96 when I ran ps aux bunch of results came out would you want me to take screen shot? Thanks.

Re: Nagios Core Alerting Current load

Posted: Wed Jun 20, 2018 2:20 pm
by scottwilkerson
If you have 16 CPU's that's not a high load.

You likely just need to change the settings for the Current Load check (which by default is for 1 CPU)

Can you share your config for that service on localhost and I can make a recommendation?

Basically you can multiply each of the items in warning and critical by 16

so if you had this for the warning 5.0,4.0,3.0 replace with this 80.0,64.0,48.0

same for critical, if you have 10.0,6.0,4.0 replace with 160.0,128.0,64.0

Re: Nagios Core Alerting Current load

Posted: Wed Jun 20, 2018 2:54 pm
by skypete
Sure what config would you need me to share and thank you.

Re: Nagios Core Alerting Current load

Posted: Wed Jun 20, 2018 2:55 pm
by scottwilkerson
skypete wrote:Sure what config would you need me to share and thank you.
The one that contains "Current Load" service for localhost

Re: Nagios Core Alerting Current load

Posted: Wed Jun 20, 2018 3:11 pm
by skypete
Is this the one your looking for check_nt_perf_counter? I am not sure the exact file. I was monitoring the server and now it is showing hope this helps.

warning critical

cpu user (76.4/76.4/76.4)
cpu user (82.9/82.9/82.9)
cpu user (72.6/72.6/72.6)

Re: Nagios Core Alerting Current load

Posted: Wed Jun 20, 2018 3:17 pm
by scottwilkerson
You said you were getting Current load alerts from the nagios server? It is the config for that service I would need, it would have nothing to do with check_nt_perf_counter

and I have no idea what these are, it would completely depend on what you have setup.
skypete wrote:warning critical

cpu user (76.4/76.4/76.4)
cpu user (82.9/82.9/82.9)
cpu user (72.6/72.6/72.6)
I really have to make a suggestion. Only monitor items that you know what to do with if they reach the warning and critical values YOU set.

Monitoring anything else is just creating noise. Also, Nagios support forums can help you resolve some issues, with Nagios but we cannot fix the servers you are monitoring. That is the job of you Systems and Network Administrators.

Re: Nagios Core Alerting Current load

Posted: Wed Jun 20, 2018 3:26 pm
by skypete
Let me explain this better I would get a email and this is what it is showing.

service: current load
address 127.0.0.1
state warning
WARNING-load average: 2.64,7.03,7.63

Re: Nagios Core Alerting Current load

Posted: Wed Jun 20, 2018 3:55 pm
by scottwilkerson
skypete wrote:Let me explain this better I would get a email and this is what it is showing.

service: current load
address 127.0.0.1
state warning
WARNING-load average: 2.64,7.03,7.63
Perfect. So you need to find the configuration file that is for current load with a host of localhost

It will look similar to this:

Code: Select all

define service {
	host_name			localhost
	service_description		Current Load
	use				local-service
	check_command			check_local_load!5.0,4.0,3.0!10.0,6.0,4.0
	max_check_attempts		1
	check_interval			10
	retry_interval			1
	notification_period		24x7
	notifications_enabled		1
	register			1
	}