Page 1 of 1

CPU Checks not reporting accurately

Posted: Wed Apr 03, 2019 1:00 pm
by rferebee
Hello,

We experienced an issue over the weekend where some of our Exchange servers CPUs were pegged above 90% but we were not alerted by Nagios XI. Upon initial inspection, it appears that Nagios XI is reporting CPU Load as roughly 20-25% lower than what the server is reporting on itself.

Please see attached screen shots.

It looks like the check is on a 5 minute interval which might explain the lack of reporting, but this server had it's CPU running at over 90% for almost 3 days. Even now it's getting up to 80-85% without ever reporting even a warning status.

Is there another check we could be using or maybe a different way to get more accurate reporting?

Thank you.

Re: CPU Checks not reporting accurately

Posted: Wed Apr 03, 2019 3:12 pm
by tgriep
This is what the current check is doing
CPULOAD =
Average CPU load on last x minutes.
Request a -l parameter with the following syntax:
-l <minutes range>,<warning threshold>,<critical threshold>.
<minute range> should be less than 24*60.
Thresholds are percentage and up to 10 requests can be done in one shot.
ie: -l 60,90,95,120,90,95
It is an Averaged check. The server has to be running over the threshold for the full timeperiod that the check is running to generate an email Notification.
If not, it will not send an email notification.
The default setting is 5 minutes.

Re: CPU Checks not reporting accurately

Posted: Mon Apr 08, 2019 1:35 pm
by rferebee
This can be locked, thank you.