(Massive) Data Transfer Between Sites

This support forum board is for support questions relating to Nagios Log Server, our solution for managing and monitoring critical log data.
Locked
myriad
Posts: 26
Joined: Tue Dec 02, 2014 1:29 pm

(Massive) Data Transfer Between Sites

Post by myriad »

I have a three instance setup.
I have one server at each site.
Whenever one server is rebooted, we're seeing loads of traffic transfer from site to site.
Why is this happening?
Am I setup wrong?
All three log servers will need to be at a single site?
If I want coverage at the three sites, will I have to separate the servers so that they will not be in a cluster?
User avatar
lgroschen
Posts: 384
Joined: Wed Nov 27, 2013 1:17 pm

Re: (Massive) Data Transfer Between Sites

Post by lgroschen »

I think this is intended. When you have 3+ servers in your cluster there is a redundant 'shard' of data that is allocated for each instance. So with 3 instances there will be 1 primary and 1 backup shard for a total of 6 shards.

When you bring a server down the other 2 servers have to make up for the hole in the data and use the backup shards of the downed server to be able to run queries on your User Interface so that you can see all the data that was on the downed server. The traffic you are seeing is likely the allocation of shards and jobs after the 3rd server went down.

Are you having to reboot servers often? You shouldn't need to so this may be abnormal behavior depending on why you reboot them.
/Luke
myriad
Posts: 26
Joined: Tue Dec 02, 2014 1:29 pm

Re: (Massive) Data Transfer Between Sites

Post by myriad »

I have had to reboot frequently to keep the servers up. now I'm looking at the index status over the last two days. since the 21st of January I was logging about 1.6-6GB daily. the number of reporting hosts bounces around. right now, I see 98 but over 160 should be reporting.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: (Massive) Data Transfer Between Sites

Post by scottwilkerson »

myriad wrote:Am I setup wrong?
All three log servers will need to be at a single site?
If I want coverage at the three sites, will I have to separate the servers so that they will not be in a cluster?
The servers in the cluster are in constant communication and need to be able to transfer significant data when a machine goes offline. This is expected behavior to protect your data, making sure there is always at least 1 primary and 1 replica shard for all of the log data you have.

Depending on your network infrastructure, it may be better to have multiple clusters if the data transfer between machines it problematic, however I would not recommend having any less than 2 instances in each cluster because you would not have any redundancy.
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
Locked