Status red with all nodes thinking it is the master
Posted: Wed Aug 12, 2015 3:33 pm
I am not sure what happened, but the amount of incoming logs dramatically decreased. So I started investigating and noticed that the GUI was running really slow. I tried running commands via the console and that also was really slow. Eventually now when I checked the cluster status, it is in red and reporting 1 node only. Checking each node, they are now reporting that they are the master with only 1 node showing. Not really show what happened, but really need to get this cluster back up and running, but wanted to make sure we bring this back without causing any split brain issues. Please help, thanks.
Node 1
# curl 'localhost:9200/_cat/master?v'
id host ip node
meXRK6XITBO6x_Mfgju6yw node1 10.242.102.107 4521585a-88af-47c9-81e5-c4d13cffb148
# curl -XGET 'http://localhost:9200/_cluster/health?pretty'
{
"cluster_name" : "80e9022e-f73f-429e-8927-f23d0d88dfd2",
"status" : "red",
"timed_out" : false,
"number_of_nodes" : 1,
"number_of_data_nodes" : 1,
"active_primary_shards" : 56,
"active_shards" : 56,
"relocating_shards" : 0,
"initializing_shards" : 1,
"unassigned_shards" : 115
}
Node 2
# curl 'localhost:9200/_cat/master?v'
id host ip node
4Ctq93IFT3WHVqK3Mo5VeQ node2 10.242.102.108 30ab2b2c-439f-4bcc-977d-7c0e9a90f3a5
# curl -XGET 'http://localhost:9200/_cluster/health?pretty'
{
"cluster_name" : "80e9022e-f73f-429e-8927-f23d0d88dfd2",
"status" : "red",
"timed_out" : false,
"number_of_nodes" : 1,
"number_of_data_nodes" : 1,
"active_primary_shards" : 59,
"active_shards" : 59,
"relocating_shards" : 0,
"initializing_shards" : 0,
"unassigned_shards" : 113
}
Node 3
# curl 'localhost:9200/_cat/master?v'
id host ip node
TPLs_kcbQca8OeYwAxlANg node3 10.242.102.109 e63648a3-d912-4f5d-a867-1b99282a5e7c
# curl -XGET 'http://localhost:9200/_cluster/health?pretty'
{
"cluster_name" : "80e9022e-f73f-429e-8927-f23d0d88dfd2",
"status" : "red",
"timed_out" : false,
"number_of_nodes" : 1,
"number_of_data_nodes" : 1,
"active_primary_shards" : 58,
"active_shards" : 58,
"relocating_shards" : 0,
"initializing_shards" : 0,
"unassigned_shards" : 114
}
Node 1
# curl 'localhost:9200/_cat/master?v'
id host ip node
meXRK6XITBO6x_Mfgju6yw node1 10.242.102.107 4521585a-88af-47c9-81e5-c4d13cffb148
# curl -XGET 'http://localhost:9200/_cluster/health?pretty'
{
"cluster_name" : "80e9022e-f73f-429e-8927-f23d0d88dfd2",
"status" : "red",
"timed_out" : false,
"number_of_nodes" : 1,
"number_of_data_nodes" : 1,
"active_primary_shards" : 56,
"active_shards" : 56,
"relocating_shards" : 0,
"initializing_shards" : 1,
"unassigned_shards" : 115
}
Node 2
# curl 'localhost:9200/_cat/master?v'
id host ip node
4Ctq93IFT3WHVqK3Mo5VeQ node2 10.242.102.108 30ab2b2c-439f-4bcc-977d-7c0e9a90f3a5
# curl -XGET 'http://localhost:9200/_cluster/health?pretty'
{
"cluster_name" : "80e9022e-f73f-429e-8927-f23d0d88dfd2",
"status" : "red",
"timed_out" : false,
"number_of_nodes" : 1,
"number_of_data_nodes" : 1,
"active_primary_shards" : 59,
"active_shards" : 59,
"relocating_shards" : 0,
"initializing_shards" : 0,
"unassigned_shards" : 113
}
Node 3
# curl 'localhost:9200/_cat/master?v'
id host ip node
TPLs_kcbQca8OeYwAxlANg node3 10.242.102.109 e63648a3-d912-4f5d-a867-1b99282a5e7c
# curl -XGET 'http://localhost:9200/_cluster/health?pretty'
{
"cluster_name" : "80e9022e-f73f-429e-8927-f23d0d88dfd2",
"status" : "red",
"timed_out" : false,
"number_of_nodes" : 1,
"number_of_data_nodes" : 1,
"active_primary_shards" : 58,
"active_shards" : 58,
"relocating_shards" : 0,
"initializing_shards" : 0,
"unassigned_shards" : 114
}