Control Cluster Shards Balancing #42739

hanbj · 2019-05-31T04:18:34Z

Previous implementations were well thought out, but an index balance would cause all indexes to balance together.

elasticmachine · 2019-05-31T11:20:21Z

Pinging @elastic/es-distributed

ywelsch

Thank you for your interest in contributing to ES. I don't understand what this change is trying to achieve, unfortunately. Can you provide some more explanations? In particular, why is it removing the code that allows index-level settings from overriding the cluster-level ones?

hanbj · 2019-06-06T15:52:51Z

If an index is allowed to be rebalance, then this line of code "if (allocation. deciders (). canRebalance (allocation). type ()!= Type. YES)" returns false, and the code continues to execute downward. In the balanceByWeights () method, all indexes are traversed according to the weight rebalance, so I set cluster. routing. rebalance. enable: "none" is invalid.

ywelsch

I'm not sure I understand what the purpose of this PR is, especially as it's disabling existing ES functionality. Is it to address a scaling issue? You mention a cluster with 300,000 shards on another PR, so I assume this is related. Why is this cluster making use of index-level rebalancing (i.e. the index.routing.rebalance.enable setting) if that is turning out to be a scaling bottleneck?

ywelsch · 2019-07-01T11:06:17Z

.../main/java/org/elasticsearch/cluster/routing/allocation/decider/EnableAllocationDecider.java

+                return allocation.decision(Decision.NO, NAME, "none rebalance are not allowed");
+            case PRIMARIES:
+                if (allocation.routingNodes().hasInactivePrimaries()) {
+                    return allocation.decision(Decision.NO, NAME,


why does this globally disable rebalancing when there are some inactive primaries of an unrelated index?

ywelsch · 2019-07-01T11:07:00Z

.../main/java/org/elasticsearch/cluster/routing/allocation/decider/EnableAllocationDecider.java

+                return allocation.decision(Decision.YES, NAME, "all primary shards is active and rebalance are allowed");
+            case REPLICAS:
+                if (allocation.routingNodes().hasInactiveShards()) {
+                    return allocation.decision(Decision.NO, NAME,


why does this globally disable rebalancing when there are inactive shards of some unrelated index?

ywelsch · 2019-07-01T11:08:18Z

.../main/java/org/elasticsearch/cluster/routing/allocation/decider/EnableAllocationDecider.java

+            case ALL:
+                return allocation.decision(Decision.YES, NAME, "all rebalance are allowed");
+            case NONE:
+                return allocation.decision(Decision.NO, NAME, "none rebalance are not allowed");


As said earlier, this changes the behavior of ES not to take the index-level property into account anymore when the cluster-level property is set.

ywelsch · 2019-07-01T11:16:00Z

I don't see a way to take this PR forward (as it is breaking existing functionality), which is why I'm closing this. @hanbj do let me know about the problem this is trying to address and in particular whether it is something that would be covered by other work (e.g. #42738)

Control Cluster Shards Balancing

aaf763e

matriv added the :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) label May 31, 2019

ywelsch self-requested a review June 4, 2019 14:59

ywelsch reviewed Jun 5, 2019

View reviewed changes

ywelsch suggested changes Jul 1, 2019

View reviewed changes

ywelsch closed this Jul 1, 2019

hanbj deleted the enabled branch May 29, 2020 06:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Control Cluster Shards Balancing #42739

Control Cluster Shards Balancing #42739

Uh oh!

hanbj commented May 31, 2019

Uh oh!

elasticmachine commented May 31, 2019

Uh oh!

ywelsch left a comment

Uh oh!

hanbj commented Jun 6, 2019

Uh oh!

ywelsch left a comment

Uh oh!

ywelsch Jul 1, 2019

Uh oh!

ywelsch Jul 1, 2019

Uh oh!

ywelsch Jul 1, 2019

Uh oh!

ywelsch commented Jul 1, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Control Cluster Shards Balancing #42739

Control Cluster Shards Balancing #42739

Uh oh!

Conversation

hanbj commented May 31, 2019

Uh oh!

elasticmachine commented May 31, 2019

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

hanbj commented Jun 6, 2019

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

ywelsch Jul 1, 2019

Choose a reason for hiding this comment

Uh oh!

ywelsch Jul 1, 2019

Choose a reason for hiding this comment

Uh oh!

ywelsch Jul 1, 2019

Choose a reason for hiding this comment

Uh oh!

ywelsch commented Jul 1, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants