Page 1 of 1

Issue with Nagios Worker Server.

Posted: Sat Jun 05, 2021 2:17 pm
by tchandra
Hi Team ,

we are using Nagios XI Server 5.7.3 version & we have 2 worker servers for this (one for DC1 & one for DC2).
one of team mate did server patching . After patching , we observed that All DC2 hosts & Services are flapping ( UP & DOWN ) with error message as (service check orphaned, is the mod-gearman worker on queue 'hostgroup_SITE-DC2' running?) and Nsclient agents getting stopped in the servers. and db services going to unknown state. And we observed that mod-gearman service is not running after server reboot.
Every time Worker Server load & I/O is getting high when checks are happening.

Can someone please suggest on this.

Regards,
Chandra

Re: Issue with Nagios Worker Server.

Posted: Mon Jun 07, 2021 11:25 am
by ssax
Server patching on the XI server or the workers? You mean OS level patching?

Please PM me a copy of your profile from your XI server, you can download it from Admin > System Profile by clicking the Download Profile button.

What is the output of this command on BOTH the XI server and on the worker that is failing?

Code: Select all

rpm -qa | grep -i gearm
What do your gearman worker logs show in /var/log/mod_gearman?

Re: Issue with Nagios Worker Server.

Posted: Thu Jun 10, 2021 2:19 am
by tchandra
Hi Sean,

Thanks for your response.

Yes , we have done with OS Level patching. we observed that load was getting heavy in worker server during check happening. So rebooted the worker server and restarted the gear man service. Anyhow issue got resolved after sometime, but unable to found the root cause.


Regards,
Chandra

Re: Issue with Nagios Worker Server.

Posted: Thu Jun 10, 2021 4:34 pm
by benjaminsmith
Hi,
Anyhow issue got resolved after sometime, but unable to found the root cause.
We can keep this open for now and if the issue comes up, please send us the system profile to review.