CPU health Check Time

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
bashar.abed
Posts: 153
Joined: Tue Oct 11, 2016 1:34 am
Location: Ramallah West Bank Palestine
Contact:

CPU health Check Time

Post by bashar.abed »

Hi,
we have nagios Xi version 5.4.10, VM machine, and we would like to know what is the minimum health check time period can be configure on the Nagios System ,since we have faced case that we have a server cpu utilization became 100 % for a second and the nagios can not detect it

[[email protected] ~]$ /usr/local/nagios/libexec/check_snmp_load.pl -H 10.x.x.x -C jawwal-snmp --v2c -w 90 -c 95 -f
16 CPU, average load 17.1% < 90% : OK | cpu_prct_used=17.0625%;90;95

Thanks
npolovenko
Support Tech
Posts: 3457
Joined: Mon May 15, 2017 5:00 pm

Re: CPU health Check Time

Post by npolovenko »

Hello, @bashar.abed.
You'd define a check interval in Service management/Check Settings. The shortest interval can be set to one minute. And i think the plugin that you're using calculates the average CPU utilization per minute. So if the CPU was at 10% for 50 sec and and at a 100% for 10sec, the check output would probably be around 25%.
If you only want to get the highest values per interval I'd probably look for a different plugin.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
bashar.abed
Posts: 153
Joined: Tue Oct 11, 2016 1:34 am
Location: Ramallah West Bank Palestine
Contact:

Re: CPU health Check Time

Post by bashar.abed »

Hi,

can you send me this plugin ,

Thanks
npolovenko
Support Tech
Posts: 3457
Joined: Mon May 15, 2017 5:00 pm

Re: CPU health Check Time

Post by npolovenko »

Hi, @bashar.abed. The lowest check interval for all Nagios plugins is once per minute. Doing checks more frequently may by itself overload the system. So depending on your system type we need to figure out if there is such a command that we can run to check the utilization more frequently. How did you find out about the spikes? And if your average load per minute is at 17%, and the CPU went to 100% for only 1sec it doesn't seem like too big of a deal. Do you know what may have caused the spike?
What system are you running on the VM?
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Locked