Mysterious load average alerts on Nagios XI 2011R3.2
Posted: Tue Jul 03, 2012 7:27 pm
We're running Nagios XI 2011R3.2 and are receiving intermittent load average alerts like the following:
The mystery is what causes the alert, because when I receive the notification I immediately check the System Status page and it shows things are just fine:
The service definition "Current Load" uses the command:
I've scanned /usr/local/nagios/var/nagios.log, and don't find any entries with high load average values.
Any ideas on where I should look for the cause of these alerts?
Code: Select all
***** Nagios *****
Notification Type: PROBLEM
Service: Current Load
Host: localhost
Address: 127.0.0.1
State: CRITICAL
Date/Time: Tue Jul 3 17:04:18 PDT 2012
Additional Info:
CRITICAL - load average: 25.89, 25.82, 26.40
Code: Select all
Load
1-min 0.16
5-min 0.13
15-min 0.09
The service definition "Current Load" uses the command:
Code: Select all
$USER1$/check_load -w $ARG1$ -c $ARG2$
with
$ARG1$ = 5.0,4.0,3.0
$ARG2$ = 10.0,6.0,4.0
Any ideas on where I should look for the cause of these alerts?