Gearman Data Retention in Collectors

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
rajasegar
Posts: 1018
Joined: Sun Mar 30, 2014 10:49 pm

Gearman Data Retention in Collectors

Post by rajasegar »

XI 2014R1.2

Code: Select all

[nagios@nagiosprodxi1 libexec]$ /usr/bin/check_gearman -V
check_gearman: version 1.4_nagios4 running on libgearman 0.25
I would like to know how long does the Gearman collectors retain the monitoring data if there is connectivity problem with the Nagios XI host?
Does gearman collectors send all the collected monitoring data that is pending submission once connectivity is Ok?

Where is this configured?


Thanks.
5 x Nagios 5.6.9 Enterprise Edition
RHEL 6 & 7
rrdcached & ramdisk optimisation
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: Gearman Data Retention in Collectors

Post by slansing »

I believe it holds a socket connection open until the timeout is either cleared, or reaches it's max elapsed time. Once either of these are met, the data is released. I can talk to some co-workers that may know a bit more about it's protocols when it comes to this, but as far as I know, there is not a lot behind the scenes, it's either sent to Nagios, or dropped when the timeout int is reached, I don't think there is any caching.
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: Gearman Data Retention in Collectors

Post by abrist »

I do not believe gearman "caches" check results. As it is a distributed setup, if one worker goes down and the checks distributed to it are not returned, the master gearman daemon will just reschedule the checks on another worker.
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
rajasegar
Posts: 1018
Joined: Sun Mar 30, 2014 10:49 pm

Re: Gearman Data Retention in Collectors

Post by rajasegar »

abrist wrote:I do not believe gearman "caches" check results. As it is a distributed setup, if one worker goes down and the checks distributed to it are not returned, the master gearman daemon will just reschedule the checks on another worker.

The collector will be at a remote site across the WAN.
So if there is any connection issue with between the collector and the Nagios XI host all checks will timeout and effectively all monitoring will be down for the site, right?
5 x Nagios 5.6.9 Enterprise Edition
RHEL 6 & 7
rrdcached & ramdisk optimisation
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: Gearman Data Retention in Collectors

Post by slansing »

Essentially, yes, just as if you were checking a remote host by making a direct connection to an NRPE agent, and if you lost connectivity there, those checks would start timing out until the connection resumed.

If you can't connect to an address properly, monitoring will not be possible until a connection is made.
rajasegar
Posts: 1018
Joined: Sun Mar 30, 2014 10:49 pm

Re: Gearman Data Retention in Collectors

Post by rajasegar »

slansing wrote:Essentially, yes, just as if you were checking a remote host by making a direct connection to an NRPE agent, and if you lost connectivity there, those checks would start timing out until the connection resumed.

If you can't connect to an address properly, monitoring will not be possible until a connection is made.
Is there a way to setup an independent XI in a remote site which syncs its data to the Central XI?
5 x Nagios 5.6.9 Enterprise Edition
RHEL 6 & 7
rrdcached & ramdisk optimisation
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: Gearman Data Retention in Collectors

Post by slansing »

Absolutely!

You can follow a couple short documents to set it up on both servers. The XI system in that network can run all of that network's checks itself, and the simply mirror and push the results up to the central XI server.

Internal: http://assets.nagios.com/downloads/nagi ... ith_XI.pdf

Central: http://assets.nagios.com/downloads/nagi ... ith_XI.pdf
rajasegar
Posts: 1018
Joined: Sun Mar 30, 2014 10:49 pm

Re: Gearman Data Retention in Collectors

Post by rajasegar »

slansing wrote:Absolutely!

You can follow a couple short documents to set it up on both servers. The XI system in that network can run all of that network's checks itself, and the simply mirror and push the results up to the central XI server.

Internal: http://assets.nagios.com/downloads/nagi ... ith_XI.pdf

Central: http://assets.nagios.com/downloads/nagi ... ith_XI.pdf
Some Questions
1) Can I confirm multiple internal servers can sync to the central server?
2) Where does the alerts notifications get generated? The internal or central server?
3) Does this work with Nagios Core internal servers as well?
4) Any potential issues with this type of setup?
5 x Nagios 5.6.9 Enterprise Edition
RHEL 6 & 7
rrdcached & ramdisk optimisation
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: Gearman Data Retention in Collectors

Post by abrist »

rajasegar wrote:1) Can I confirm multiple internal servers can sync to the central server?
We need to clarify this. Pushing checks to a central server will not "sync". They are literally pushed. If the central server is not reachable for a period of time, any checks run by a remote XI server will not get pushed to the central server.
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
rajasegar
Posts: 1018
Joined: Sun Mar 30, 2014 10:49 pm

Re: Gearman Data Retention in Collectors

Post by rajasegar »

abrist wrote:
rajasegar wrote:1) Can I confirm multiple internal servers can sync to the central server?
We need to clarify this. Pushing checks to a central server will not "sync". They are literally pushed. If the central server is not reachable for a period of time, any checks run by a remote XI server will not get pushed to the central server.
I am looking to sync (update) the check data to the central XI.
Remote server can decide not to push anymore? It should be able to resume when connection is established.
5 x Nagios 5.6.9 Enterprise Edition
RHEL 6 & 7
rrdcached & ramdisk optimisation
Locked