Skip to content

Follow-up work for desired balance allocator #91386

@DaveCTurner

Description

@DaveCTurner

Allocator behavior

  • Exclude shards from desired balance that can not stay on the node they are currently residing nor move anywhere else. This does not hurt anything at the moment but could be very surprising to see a node id in a desired set if any of the deciders return NO for it.
  • Minimize shard movements when balancing the cluster during compute phase.
  • ensure ClusterInfoSimulator is not diverging a lot form real ClusterInfo after shards are relocated as accumulating error could result in a poor assignments during computations
  • Address apparent preference to concentrate large shards on some nodes and small shards on others

Stats

  • Compute fraction of shards allocated on fallback nodes. High value indicate the cluster is ignoring assignments. This will result in more future shard movements.
  • Measure average time between routing table changes such as indices additions, deletions and settings change that require a shard movement. Measure average time to move the shard
  • convert DesiredBalanceShardsAllocator metrics to a proper apm metrics so that they could be observed over the time
  • Desired-balance warn threshold logging should accumulate across restarts #100850

API improvements

Other

Metadata

Metadata

Assignees

Labels

:Distributed Coordination/AllocationAll issues relating to the decision making around placing a shard (both master logic & on the nodes)MetaTeam:Distributed (Obsolete)Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination.Team:Distributed CoordinationMeta label for Distributed Coordination team

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions