gearman - a lot JOBs waiting

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
rkennedy
Posts: 6579
Joined: Mon Oct 05, 2015 11:45 am

Re: gearman - a lot JOBs waiting

Post by rkennedy »

Can you please PM over your profile for us to take a deeper look at?
Former Nagios Employee
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: gearman - a lot JOBs waiting

Post by bosecorp »

before we head into that direction. I don;t see this problem in my other gearman servers. I don't see this problem on my master gearman server
bheden
Product Development Manager
Posts: 179
Joined: Thu Feb 13, 2014 9:50 am
Location: Nagios Enterprises

Re: gearman - a lot JOBs waiting

Post by bheden »

Can you please post the contents of /etc/mod_gearman/mod_gearman_neb.conf and /var/log/gearmand/gearmand.log?

You might try increasing the value of "result_workers" in the meantime. It looks specifically like its the check_results queue that is hung up, but the hostgroup worker queue is executing properly.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Nagios Enterprises
Senior Developer
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: gearman - a lot JOBs waiting

Post by bosecorp »

Increased # workers, but I still see the same issue
bheden
Product Development Manager
Posts: 179
Joined: Thu Feb 13, 2014 9:50 am
Location: Nagios Enterprises

Re: gearman - a lot JOBs waiting

Post by bheden »

Can you please post the contents of /etc/mod_gearman/mod_gearman_neb.conf and /var/log/gearmand/gearmand.log?
Did you increase result_workers (via result_workers in the neb configuration) or did you increase workers (via min-worker and max-worker in the worker configuration)? These are two different things.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Nagios Enterprises
Senior Developer
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: gearman - a lot JOBs waiting

Post by bosecorp »

via the worker.....the problem is not with the NEB module, it's with only one worker

for some reason, today seems to be fine.......I am going to monitor this worker today and tomorrow
rkennedy
Posts: 6579
Joined: Mon Oct 05, 2015 11:45 am

Re: gearman - a lot JOBs waiting

Post by rkennedy »

That's good news, let us know how your testing goes.
Former Nagios Employee
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: gearman - a lot JOBs waiting

Post by bosecorp »

spoke too soon.

I see the # jobs waiting going up, but then goes back down.
bheden
Product Development Manager
Posts: 179
Joined: Thu Feb 13, 2014 9:50 am
Location: Nagios Enterprises

Re: gearman - a lot JOBs waiting

Post by bheden »

Still just in reference to the results queue?

Did you ever increase the results worker in the neb configuration? If the result worker can't keep up with the results for whatever reason, you need to increase that value.

Also -
I started noticing that my gearman server is getting behind on JOBs

seems to be having a hard time keeping up
So what is the problem exactly? Are you seeing any checks latency? Are you seeing improper/stale checks come back in Nagios? Or does it just *seem* like its taking longer than it should? Is there a value that the results worker queue sticks around usually, or does it continuously increase in size (and stay increasing in size)?
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Nagios Enterprises
Senior Developer
bosecorp
Posts: 929
Joined: Thu Jun 26, 2014 1:00 pm

Re: gearman - a lot JOBs waiting

Post by bosecorp »

the issues is that some times I see the jobs waiting queue building up
Locked