Nagios 4 Load issues
Posted: Fri Oct 21, 2016 3:33 am
Moderator Edit: This thread has been split from another - https://support.nagios.com/forum/viewtopic.php?t=27068
In the future, please create a new thread and link to the old one instead of adding on.
Sorry to bump an old thread, but I'm experiencing very similar behaviour on some Nagios hosts I've recently built. Initially I built a RHEL7 host using Nagios 4.0.8 that comes with the distro, but I've since built an identical instance using 4.2.1 from source in hope that it may have been due to a bug that has been fixed, but it's also showing the same behaviour. In fact all hosts I've built across test, stage and our production environments are showing the same regular spikey load patterns. The hosts seem to perform fine even when exhibiting these high load spikes.
The auto rescheduler is already disabled:
$ grep auto nagios.cfg
auto_reschedule_checks=0
auto_rescheduling_interval=30
auto_rescheduling_window=180
I've attached a sample of load graphs from my hosts. The busiest of these servers has around 140 hosts with just shy of 4000 service checks.
I note the following support article which refers to nom checkpoints, however this seems to be a Nagiox XI thing (but it does refer to Nagios 4 Core) https://support.nagios.com/kb/article.php?id=150
In the future, please create a new thread and link to the old one instead of adding on.
Sorry to bump an old thread, but I'm experiencing very similar behaviour on some Nagios hosts I've recently built. Initially I built a RHEL7 host using Nagios 4.0.8 that comes with the distro, but I've since built an identical instance using 4.2.1 from source in hope that it may have been due to a bug that has been fixed, but it's also showing the same behaviour. In fact all hosts I've built across test, stage and our production environments are showing the same regular spikey load patterns. The hosts seem to perform fine even when exhibiting these high load spikes.
The auto rescheduler is already disabled:
$ grep auto nagios.cfg
auto_reschedule_checks=0
auto_rescheduling_interval=30
auto_rescheduling_window=180
I've attached a sample of load graphs from my hosts. The busiest of these servers has around 140 hosts with just shy of 4000 service checks.
I note the following support article which refers to nom checkpoints, however this seems to be a Nagiox XI thing (but it does refer to Nagios 4 Core) https://support.nagios.com/kb/article.php?id=150