Nagios Core Alerting Current load
Nagios Core Alerting Current load
Hi I have Been getting alerts stating current load from nagios core. This has been of a lot lately not sure what the issue it is. It is a physical server and has good amount of cpu power and memory. Can some point me in the right direction to resolve this been a pain of late. Thanks
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Nagios Core Alerting Current load
Can you be more specific? what load is it reaching? How many CPU's does the system have, what is the output of wa when running
What are the results of
Code: Select all
top
Code: Select all
ps aux
Re: Nagios Core Alerting Current load
Hi it has 16 cpu's the load average is showing now 1.26,4,39 4,96 when I ran ps aux bunch of results came out would you want me to take screen shot? Thanks.
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Nagios Core Alerting Current load
If you have 16 CPU's that's not a high load.
You likely just need to change the settings for the Current Load check (which by default is for 1 CPU)
Can you share your config for that service on localhost and I can make a recommendation?
Basically you can multiply each of the items in warning and critical by 16
so if you had this for the warning 5.0,4.0,3.0 replace with this 80.0,64.0,48.0
same for critical, if you have 10.0,6.0,4.0 replace with 160.0,128.0,64.0
You likely just need to change the settings for the Current Load check (which by default is for 1 CPU)
Can you share your config for that service on localhost and I can make a recommendation?
Basically you can multiply each of the items in warning and critical by 16
so if you had this for the warning 5.0,4.0,3.0 replace with this 80.0,64.0,48.0
same for critical, if you have 10.0,6.0,4.0 replace with 160.0,128.0,64.0
Re: Nagios Core Alerting Current load
Sure what config would you need me to share and thank you.
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Nagios Core Alerting Current load
The one that contains "Current Load" service for localhostskypete wrote:Sure what config would you need me to share and thank you.
Re: Nagios Core Alerting Current load
Is this the one your looking for check_nt_perf_counter? I am not sure the exact file. I was monitoring the server and now it is showing hope this helps.
warning critical
cpu user (76.4/76.4/76.4)
cpu user (82.9/82.9/82.9)
cpu user (72.6/72.6/72.6)
warning critical
cpu user (76.4/76.4/76.4)
cpu user (82.9/82.9/82.9)
cpu user (72.6/72.6/72.6)
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Nagios Core Alerting Current load
You said you were getting Current load alerts from the nagios server? It is the config for that service I would need, it would have nothing to do with check_nt_perf_counter
and I have no idea what these are, it would completely depend on what you have setup.
Monitoring anything else is just creating noise. Also, Nagios support forums can help you resolve some issues, with Nagios but we cannot fix the servers you are monitoring. That is the job of you Systems and Network Administrators.
and I have no idea what these are, it would completely depend on what you have setup.
I really have to make a suggestion. Only monitor items that you know what to do with if they reach the warning and critical values YOU set.skypete wrote:warning critical
cpu user (76.4/76.4/76.4)
cpu user (82.9/82.9/82.9)
cpu user (72.6/72.6/72.6)
Monitoring anything else is just creating noise. Also, Nagios support forums can help you resolve some issues, with Nagios but we cannot fix the servers you are monitoring. That is the job of you Systems and Network Administrators.
Re: Nagios Core Alerting Current load
Let me explain this better I would get a email and this is what it is showing.
service: current load
address 127.0.0.1
state warning
WARNING-load average: 2.64,7.03,7.63
service: current load
address 127.0.0.1
state warning
WARNING-load average: 2.64,7.03,7.63
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: Nagios Core Alerting Current load
Perfect. So you need to find the configuration file that is for current load with a host of localhostskypete wrote:Let me explain this better I would get a email and this is what it is showing.
service: current load
address 127.0.0.1
state warning
WARNING-load average: 2.64,7.03,7.63
It will look similar to this:
Code: Select all
define service {
host_name localhost
service_description Current Load
use local-service
check_command check_local_load!5.0,4.0,3.0!10.0,6.0,4.0
max_check_attempts 1
check_interval 10
retry_interval 1
notification_period 24x7
notifications_enabled 1
register 1
}