Issue with Nagios Worker Server.

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
tchandra
Posts: 38
Joined: Fri Oct 18, 2019 5:53 am

Issue with Nagios Worker Server.

Post by tchandra »

Hi Team ,

we are using Nagios XI Server 5.7.3 version & we have 2 worker servers for this (one for DC1 & one for DC2).
one of team mate did server patching . After patching , we observed that All DC2 hosts & Services are flapping ( UP & DOWN ) with error message as (service check orphaned, is the mod-gearman worker on queue 'hostgroup_SITE-DC2' running?) and Nsclient agents getting stopped in the servers. and db services going to unknown state. And we observed that mod-gearman service is not running after server reboot.
Every time Worker Server load & I/O is getting high when checks are happening.

Can someone please suggest on this.

Regards,
Chandra
ssax
Dreams In Code
Posts: 7682
Joined: Wed Feb 11, 2015 12:54 pm

Re: Issue with Nagios Worker Server.

Post by ssax »

Server patching on the XI server or the workers? You mean OS level patching?

Please PM me a copy of your profile from your XI server, you can download it from Admin > System Profile by clicking the Download Profile button.

What is the output of this command on BOTH the XI server and on the worker that is failing?

Code: Select all

rpm -qa | grep -i gearm
What do your gearman worker logs show in /var/log/mod_gearman?
tchandra
Posts: 38
Joined: Fri Oct 18, 2019 5:53 am

Re: Issue with Nagios Worker Server.

Post by tchandra »

Hi Sean,

Thanks for your response.

Yes , we have done with OS Level patching. we observed that load was getting heavy in worker server during check happening. So rebooted the worker server and restarted the gear man service. Anyhow issue got resolved after sometime, but unable to found the root cause.


Regards,
Chandra
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: Issue with Nagios Worker Server.

Post by benjaminsmith »

Hi,
Anyhow issue got resolved after sometime, but unable to found the root cause.
We can keep this open for now and if the issue comes up, please send us the system profile to review.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
Locked