CPU Checks not reporting accurately

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
rferebee
Posts: 733
Joined: Wed Jul 11, 2018 11:37 am

CPU Checks not reporting accurately

Post by rferebee »

Hello,

We experienced an issue over the weekend where some of our Exchange servers CPUs were pegged above 90% but we were not alerted by Nagios XI. Upon initial inspection, it appears that Nagios XI is reporting CPU Load as roughly 20-25% lower than what the server is reporting on itself.

Please see attached screen shots.

It looks like the check is on a 5 minute interval which might explain the lack of reporting, but this server had it's CPU running at over 90% for almost 3 days. Even now it's getting up to 80-85% without ever reporting even a warning status.

Is there another check we could be using or maybe a different way to get more accurate reporting?

Thank you.
You do not have the required permissions to view the files attached to this post.
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: CPU Checks not reporting accurately

Post by tgriep »

This is what the current check is doing
CPULOAD =
Average CPU load on last x minutes.
Request a -l parameter with the following syntax:
-l <minutes range>,<warning threshold>,<critical threshold>.
<minute range> should be less than 24*60.
Thresholds are percentage and up to 10 requests can be done in one shot.
ie: -l 60,90,95,120,90,95
It is an Averaged check. The server has to be running over the threshold for the full timeperiod that the check is running to generate an email Notification.
If not, it will not send an email notification.
The default setting is 5 minutes.
Be sure to check out our Knowledgebase for helpful articles and solutions!
rferebee
Posts: 733
Joined: Wed Jul 11, 2018 11:37 am

Re: CPU Checks not reporting accurately

Post by rferebee »

This can be locked, thank you.
Locked