Safe way to add storage to clustered nodes or reboot nodes?

This support forum board is for support questions relating to Nagios Log Server, our solution for managing and monitoring critical log data.
Locked
zbarnett
Posts: 5
Joined: Tue Dec 18, 2018 4:36 pm

Safe way to add storage to clustered nodes or reboot nodes?

Post by zbarnett »

When dealing with Nagios Log Server clusters, are there any things I should be aware of when dealing with individual nodes so that the cluster is not negatively affected? For example:

1) When adding storage to an individual node, can I simply perform the usual steps for adding storage and expanding a logical volume in a Linux server, as long as I only touch one node at a time, or are there special steps or considerations I should follow to avoid breaking the clustering?

2) Likewise, when patching and rebooting an individual node, is it safe to do so as long as I only bring down one node at a time and allow it to rejoin the cluster before bringing down the next node, or are there other steps that I should be aware of?
ssax
Dreams In Code
Posts: 7682
Joined: Wed Feb 11, 2015 12:54 pm

Re: Safe way to add storage to clustered nodes or reboot nod

Post by ssax »

I would make sure that you have snapshots & maintenance setup and backing up your log data to a snapshot repository (and taking Log Server backups) before doing any modifications to the nodes just in case something unexpected occurs, you can see here:

https://support.nagios.com/kb/article.php?id=68

Make sure the cluster health is green before you start doing node maintenance.

Then on the node you're working on I would just stop logstash and elasticsearch before doing anything:
- While not necessary it can help prevent contention issues/etc when you're interacting with the disks

Code: Select all

systemctl stop logstash
systemctl stop elasticsearch
Then do your changes and start them back up after:

Code: Select all

systemctl start logstash
systemctl start elasticsearch
For both 1) and 2) the only thing that I would add is that once you stop the node/stop the services on the node, after you're done and you bring that node back up, make sure you wait until your cluster status turns green before moving onto the next node so everything gets synced up properly.
User avatar
cdienger
Support Tech
Posts: 5045
Joined: Tue Feb 07, 2017 11:26 am

Re: Safe way to add storage to clustered nodes or reboot nod

Post by cdienger »

Welcome to the forums, @zbarnett!

In both instances I would go with working on one node at a time as you've indicated. The only thing I would add would be to to disable shard allocation first and enable it again after you've done the upgrade/maintenance/etc.. The commands to disable/enable shard allocation is found in our upgrade guide at https://assets.nagios.com/downloads/nag ... Server.pdf:

Code: Select all

curl -XPUT localhost:9200/_cluster/settings -d '{"transient":{"cluster.routing.allocation.enable":"none"}}'
curl -XPUT localhost:9200/_cluster/settings -d '{"transient":{"cluster.routing.allocation.enable":"all"}}'
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
Locked