CPU Critical 100 alert sent as Warning?
Posted: Mon Sep 13, 2021 10:42 am
Had a strange one this morning:
Using ncpa to monitor Windows servers
Check interval: 6 min
Retry interval: 1 min
Max retries: 4
cpu/percent -w '97' -c '100' -q 'aggregate=avg'
Alert notification:
Service: CPU Load \State: WARNING
Info: WARNING: Percent was 100.00 %
My guess? The monitor rechecked four times in four mins. so if the average calculated was less than "100" for the four checks (say 99.9%) the severity would be a Warning, not a Critical.
Unless someone else has a better explanation, I suggested changing the Critical threshold to 99%
check_ncpa.py, Version 1.2.4
Nagios XI 5.8.5
Thanks!
Craig
Using ncpa to monitor Windows servers
Check interval: 6 min
Retry interval: 1 min
Max retries: 4
cpu/percent -w '97' -c '100' -q 'aggregate=avg'
Alert notification:
Service: CPU Load \State: WARNING
Info: WARNING: Percent was 100.00 %
My guess? The monitor rechecked four times in four mins. so if the average calculated was less than "100" for the four checks (say 99.9%) the severity would be a Warning, not a Critical.
Unless someone else has a better explanation, I suggested changing the Critical threshold to 99%
check_ncpa.py, Version 1.2.4
Nagios XI 5.8.5
Thanks!
Craig