Remove node-level canAllocate override #59389

DaveCTurner · 2020-07-13T09:33:56Z

Today there is a node-level canAllocate override which the balancer
uses to ignore certain nodes to which it is certain no more shards can
be allocated. In fact this override only ignores nodes which have hit
the rarely-used cluster.routing.allocation.total_shards_per_node
limit, so this optimization doesn't have a meaningful impact on real
clusters.

This commit removes this unnecessary fast path from the balancer, and
also removes all the machinery needed to support it.

Today there is a node-level `canAllocate` override which the balancer uses to ignore certain nodes to which it is certain no more shards can be allocated. In fact this override only ignores nodes which have hit the rarely-used `cluster.routing.allocation.total_shards_per_node` limit, so this optimization doesn't have a meaningful impact on real clusters. This commit removes this unnecessary fast path from the balancer, and also removes all the machinery needed to support it.

elasticmachine · 2020-07-13T09:33:58Z

Pinging @elastic/es-distributed (:Distributed/Allocation)

DaveCTurner · 2020-07-13T09:35:22Z

@zuketo do we have any data that can show how rarely cluster.routing.allocation.total_shards_per_node is actually used? I couldn't see an obvious way to determine that from any telemetry.

zuketo · 2020-07-14T21:38:10Z

I couldn't find any good data sources for this (other than adding to telemetry). This setting is also not whitelisted by cloud, so no data points there. Could we deprecate first and then look at removal?

DaveCTurner · 2020-07-15T07:49:13Z

Thanks for confirming, Jason. TBC we're not talking about removing the setting itself, only the 100 lines of code that treats this setting as a special case in the shard allocator. Let's see what the team discussion brings.

DaveCTurner · 2020-07-22T13:58:41Z

Absent a better source of data I looked through as many user interactions as I could find and only encountered 7 mentions of this setting in the last 90 days, and I don't think any of them would have benefitted from this optimisation. We discussed this today as well and agreed to proceed.

ywelsch

LGTM

Today there is a node-level `canAllocate` override which the balancer uses to ignore certain nodes to which it is certain no more shards can be allocated. In fact this override only ignores nodes which have hit the rarely-used `cluster.routing.allocation.total_shards_per_node` limit, so this optimization doesn't have a meaningful impact on real clusters. This commit removes this unnecessary fast path from the balancer, and also removes all the machinery needed to support it.

DaveCTurner added :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) >refactoring team-discuss v8.0.0 v7.9.0 labels Jul 13, 2020

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Jul 13, 2020

DaveCTurner mentioned this pull request Jul 13, 2020

Optimize variable in unassigned shards allocation logic. #59350

Closed

DaveCTurner requested a review from ywelsch July 22, 2020 13:58

DaveCTurner removed the team-discuss label Jul 22, 2020

ywelsch approved these changes Jul 22, 2020

View reviewed changes

DaveCTurner merged commit c1274a4 into elastic:master Jul 23, 2020

DaveCTurner deleted the 2020-07-13-remove-node-level-canAllocate branch July 23, 2020 07:48

DaveCTurner restored the 2020-07-13-remove-node-level-canAllocate branch July 23, 2020 07:48

DaveCTurner deleted the 2020-07-13-remove-node-level-canAllocate branch July 23, 2020 07:48

howardhuanghua mentioned this pull request Sep 7, 2020

Remove unsed deciders in BalancedShardsAllocator. #62026

Merged

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove node-level canAllocate override #59389

Remove node-level canAllocate override #59389

Uh oh!

DaveCTurner commented Jul 13, 2020

Uh oh!

elasticmachine commented Jul 13, 2020

Uh oh!

DaveCTurner commented Jul 13, 2020

Uh oh!

zuketo commented Jul 14, 2020

Uh oh!

DaveCTurner commented Jul 15, 2020

Uh oh!

DaveCTurner commented Jul 22, 2020

Uh oh!

ywelsch left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Remove node-level canAllocate override #59389

Remove node-level canAllocate override #59389

Uh oh!

Conversation

DaveCTurner commented Jul 13, 2020

Uh oh!

elasticmachine commented Jul 13, 2020

Uh oh!

DaveCTurner commented Jul 13, 2020

Uh oh!

zuketo commented Jul 14, 2020

Uh oh!

DaveCTurner commented Jul 15, 2020

Uh oh!

DaveCTurner commented Jul 22, 2020

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants