jdalrymple wrote:Now - onto what my coworker Box293 would recommend - switch to his check.
https://exchange.nagios.org/directory/P ... re/details
He works on it a lot and it is configured in such a fashion that the checks are performed on a vMA (VMware Management Assistant) instead of directly on your XI box.
Frédéric GRANAT wrote:You said :
At most 30 hosts and about 1000 VMs. That said your environment is likely much larger so therein lies the additional load
My answer : No I said we have 291 hosts (16 ESX servers) and 725 services.
Before I add the 5 new ESX servers, the CPU consumption was fine.
Offloading the ESX checks from Nagios XI will make a difference. This was the reason why I developed the box293_check_vmware plugin.
However, try the following as I am pretty sure it'll make a difference now.
What I am going to suggest is that we modify the command parameter for these check_esx3.pl checks so the plugin runs at a lower priority. The plugin will still operate normally, but it won't hog the CPU when other processes are requesting it.
Configure > Core Configuration Manager
Commands > Commands
Search for esx3
Click the check_esx3_guest
Change it to:
nice -n19 $USER1$/check_esx3.pl -H "$HOSTADDRESS$" -f "$ARG1$" -N "$ARG2$" -l "$ARG3$"$ARG4$
Click Save
Click the check_esx3_host
Change it to:
nice -n19 $USER1$/check_esx3.pl -H "$HOSTADDRESS$" -f "$ARG1$" -l "$ARG2$"$ARG3$
Click Save
Click Apply Configuration.
All I have done is prepended the command with "nice -n19".
I would observe the system for a while to see if this makes any different to the performance. You can look at the Nagios XI historic performance graphs by looking at the localhost service "Current Load".