Do you know if you're calling mod_gearman_worker with any arguments?
Please run on dce1:
Code: Select all
ps -ef | grep gearmanCode: Select all
ps -ef | grep gearmanCode: Select all
When we started:
Queue Name | Worker Available | Jobs Waiting | Jobs Running
-------------------------------------------------------------------------
check_results | 1 | 0 | 0
eventhandler | 6 | 0 | 0
host | 13 | 0 | 2
hostgroup_gearman_dce1 | 5 | 0 | 0
hostgroup_gearman_dcn1 | 5 | 0 | 0
service | 13 | 0 | 0
worker_gearmandce1 | 1 | 0 | 0
worker_gearmandcn1 | 1 | 0 | 0
worker_nagmonus1 | 1 | 0 | 0
worker_nagmonus2 | 1 | 0 | 0
-------------------------------------------------------------------------
After modifying NEB config
Queue Name | Worker Available | Jobs Waiting | Jobs Running
----------------------------------------------------------------------------
check_results | 2 | 103 | 2
eventhandler | 34 | 0 | 0
host | 54 | 0 | 0
hostgroup_gearman_dce1 | 7 | 0 | 2
hostgroup_gearman_dcn1 | 7 | 0 | 3
service | 54 | 0 | 40
servicegroup_gearman_dce1 | 7 | 0 | 0
servicegroup_gearman_dcn1 | 7 | 0 | 0
worker_gearmandce1 | 1 | 0 | 0
worker_gearmandcn1 | 1 | 0 | 0
worker_nagmonus1 | 1 | 0 | 0
worker_nagmonus2 | 1 | 0 | 0
----------------------------------------------------------------------------
After modifying the worker configs
Queue Name | Worker Available | Jobs Waiting | Jobs Running
----------------------------------------------------------------------------
check_results | 4 | 0 | 1
eventhandler | 51 | 0 | 0
host | 62 | 0 | 1
hostgroup_gearman_dce1 | 7 | 0 | 0
hostgroup_gearman_dcn1 | 5 | 0 | 0
service | 62 | 0 | 39
servicegroup_gearman_dce1 | 7 | 0 | 2
servicegroup_gearman_dcn1 | 5 | 0 | 0
worker_gearmandce1 | 1 | 0 | 0
worker_gearmandcn1 | 1 | 0 | 0
worker_nagmonus1 | 1 | 0 | 0
worker_nagmonus2 | 1 | 0 | 0
----------------------------------------------------------------------------IO wait is potentially caused by the isssue, not causing the issue.bosecorp wrote: Yes, the devices are in a different geographic locations
we placed a worker per location.
could IO wait be the issue?