Setup:
--------
We run a Nagios 3 Core with the check_wmi_plus plugin.
The plugin uses Zenoss' wmic, which Ive compiled from source.
Nagios runs under Debian Squeeze in a VMware ESXi Hypervisor. The Linux has plenty of RAM + 4 Cores with 2,3 GHz each.
The Nagios installation checks about 300 Hosts with ca. 3000 checks in total.
Question:
------------
Is it possible to reduce the load or use the CPUs more efficently?
Top shows constantly a load average: 5.97, 5.83, 5.62.
Whats interesting each CPU runs at under <30% user with idle at about 70%.
I have the suspicion that its not using the whole CPU power....
The Linux VM also slows down the whole VMware Hyperisor, still not running at 100% (see below).
Code: Select all
top - 14:17:33 up 47 days, 4:11, 1 user, load average: 7.41, 6.16, 5.73
Tasks: 113 total, 2 running, 110 sleeping, 0 stopped, 1 zombie
Cpu0 : 13.8%us, 10.8%sy, 0.0%ni, 74.4%id, 0.7%wa, 0.0%hi, 0.3%si, 0.0%st
Cpu1 : 16.3%us, 12.1%sy, 0.0%ni, 71.0%id, 0.7%wa, 0.0%hi, 0.0%si, 0.0%st
Cpu2 : 13.6%us, 10.4%sy, 0.0%ni, 76.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st
Cpu3 : 12.1%us, 9.2%sy, 0.0%ni, 78.0%id, 0.3%wa, 0.0%hi, 0.3%si, 0.0%st
Mem: 3895264k total, 3459336k used, 435928k free, 299660k buffers
Swap: 477176k total, 52k used, 477124k free, 2730444k cached
Code: Select all
Service Check Execution Time: 0.03 / 31.30 / 2.650 sec
Service Check Latency: 1.74 / 199.01 / 169.317 sec
Host Check Execution Time: 0.02 / 15.26 / 3.921 sec
Host Check Latency: 30.98 / 186.18 / 167.100 sec
HK