In the future, please create a new thread and link to the old one instead of adding on.
We have similar problem using Nagios 4.0.8 and don't have "mod gearman" option.
Tried to increase number of workers and processes, but no noticeable changes were observed. Also same issue with 4.1.1 and 4.1.2Pre.
Similar design with less hosts and checks is implemented on a different node, and this issue is not present.
Any suggestion what else to look at? Of course we could create another node with less hosts and checks, but then it wouldn't be scalable.
Code: Select all
nagios -v /etc/nagios/nagios.cfg
running pre-flight check on configuration data...
Checking objects...
Checked 2404 services.
Checked 376 hosts.
Checked 151 host groups.
Checked 53 service groups.
Checked 74 contacts.
Checked 13 contact groups.
Checked 99 commands.
Checked 16 time periods.
Checked 752 host escalations.
Checked 38714 service escalations.
Checking for circular paths...
Checked 376 hosts
Checked 0 service dependencies
Checked 0 host dependencies
Checked 16 timeperiods
Checking global event handlers...
Checking obsessive compulsive processor commands...
Checking misc settings...
Total Warnings: 0
Total Errors: 0
Things look okay - No serious problems were detected during the pre-flight check
Code: Select all
nagiostats
Nagios Stats 4.0.8
Copyright (c) 2003-2008 Ethan Galstad (www.nagios.org)
Last Modified: 08-12-2014
License: GPL
CURRENT STATUS DATA
------------------------------------------------------
Status File: /var/log/nagios/status.dat
Status File Age: 0d 0h 0m 1s
Status File Version: 4.0.8
Program Running Time: 0d 12h 42m 31s
Nagios PID: 24115
Total Services: 2404
Services Checked: 2404
Services Scheduled: 2401
Services Actively Checked: 2401
Services Passively Checked: 3
Total Service State Change: 0.000 / 10.260 / 0.012 %
Active Service Latency: 0.000 / 1.821 / 0.002 sec
Active Service Execution Time: 0.003 / 4.014 / 0.802 sec
Active Service State Change: 0.000 / 10.260 / 0.004 %
Active Services Last 1/5/15/60 min: 99 / 601 / 1303 / 2260
Passive Service Latency: 0.331 / 0.719 / 0.586 sec
Passive Service State Change: 6.250 / 6.250 / 6.250 %
Passive Services Last 1/5/15/60 min: 0 / 0 / 0 / 0
Services Ok/Warn/Unk/Crit: 2383 / 7 / 1 / 13
Services Flapping: 0
Services In Downtime: 0
Total Hosts: 376
Hosts Checked: 376
Hosts Scheduled: 376
Hosts Actively Checked: 376
Host Passively Checked: 0
Total Host State Change: 0.000 / 0.000 / 0.000 %
Active Host Latency: 0.000 / 0.016 / 0.001 sec
Active Host Execution Time: 0.003 / 0.008 / 0.005 sec
Active Host State Change: 0.000 / 0.000 / 0.000 %
Active Hosts Last 1/5/15/60 min: 61 / 356 / 363 / 364
Passive Host Latency: 0.000 / 0.000 / 0.000 sec
Passive Host State Change: 0.000 / 0.000 / 0.000 %
Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0
Hosts Up/Down/Unreach: 376 / 0 / 0
Hosts Flapping: 0
Hosts In Downtime: 0
Active Host Checks Last 1/5/15 min: 106 / 388 / 1098
Scheduled: 106 / 388 / 1098
On-demand: 0 / 0 / 0
Parallel: 106 / 388 / 1098
Serial: 0 / 0 / 0
Cached: 0 / 0 / 0
Passive Host Checks Last 1/5/15 min: 0 / 0 / 0
Active Service Checks Last 1/5/15 min: 142 / 628 / 1747
Scheduled: 142 / 628 / 1747
On-demand: 0 / 0 / 0
Cached: 0 / 0 / 0
Passive Service Checks Last 1/5/15 min: 0 / 0 / 0
External Commands Last 1/5/15 min: 0 / 0 / 0