Page 1 of 1

CPU health Check Time

Posted: Tue Nov 14, 2017 3:48 am
by bashar.abed
Hi,
we have nagios Xi version 5.4.10, VM machine, and we would like to know what is the minimum health check time period can be configure on the Nagios System ,since we have faced case that we have a server cpu utilization became 100 % for a second and the nagios can not detect it

[[email protected] ~]$ /usr/local/nagios/libexec/check_snmp_load.pl -H 10.x.x.x -C jawwal-snmp --v2c -w 90 -c 95 -f
16 CPU, average load 17.1% < 90% : OK | cpu_prct_used=17.0625%;90;95

Thanks

Re: CPU health Check Time

Posted: Tue Nov 14, 2017 10:40 am
by npolovenko
Hello, @bashar.abed.
You'd define a check interval in Service management/Check Settings. The shortest interval can be set to one minute. And i think the plugin that you're using calculates the average CPU utilization per minute. So if the CPU was at 10% for 50 sec and and at a 100% for 10sec, the check output would probably be around 25%.
If you only want to get the highest values per interval I'd probably look for a different plugin.

Re: CPU health Check Time

Posted: Thu Nov 16, 2017 1:23 am
by bashar.abed
Hi,

can you send me this plugin ,

Thanks

Re: CPU health Check Time

Posted: Thu Nov 16, 2017 11:01 am
by npolovenko
Hi, @bashar.abed. The lowest check interval for all Nagios plugins is once per minute. Doing checks more frequently may by itself overload the system. So depending on your system type we need to figure out if there is such a command that we can run to check the utilization more frequently. How did you find out about the spikes? And if your average load per minute is at 17%, and the CPU went to 100% for only 1sec it doesn't seem like too big of a deal. Do you know what may have caused the spike?
What system are you running on the VM?