Follow-up work for desired balance allocator

## Allocator behavior

- [ ] Exclude shards from desired balance that can not stay on the node they are currently residing nor move anywhere else. This does not hurt anything at the moment but could be very surprising to see a node id in a desired set if any of the deciders return NO for it.
- [ ] Minimize shard movements when balancing the cluster during compute phase. 
- [ ] ensure `ClusterInfoSimulator` is not diverging a lot form real ClusterInfo after shards are relocated as accumulating error could result in a poor assignments during computations
- [ ] Address apparent preference to concentrate large shards on some nodes and small shards on others

## Stats

- [ ] Compute fraction of shards allocated on fallback nodes. High value indicate the cluster is ignoring assignments. This will result in more future shard movements.
- [ ] Measure average time between routing table changes such as indices additions, deletions and settings change that require a shard movement. Measure average time to move the shard
- [ ] convert DesiredBalanceShardsAllocator metrics to a proper apm metrics so that they could be observed over the time
- [x] https://github.com/elastic/elasticsearch/issues/100850

## API improvements

- [ ] Add forecasts (ingest and disk) to `/_cat/allocation` and node stats api (https://github.com/elastic/elasticsearch/pull/97561)
- [ ] Expose desired nodes in `/_cluster/allocation/explain` api
- [ ] Report both node id and node name in `/_internal/desired_balance` for current and desired nodes.
- [ ] Expose balancing metrics over node stats api (https://github.com/elastic/elasticsearch/issues/92097)

## Other

- [ ] https://github.com/elastic/elasticsearch/pull/91343#discussion_r1015250509: extract common code from `ResizeAllocationDecider#canAllocate` and `ResizeAllocationDecider#getForcedInitialShardAllocationToNodes`.
- [x] https://github.com/elastic/elasticsearch/pull/91343#discussion_r1015376521: confirm whether it's enough to rely on the invariants of `RoutingNodes` to protect against assigning multiple copies of a shard to a node.
- [x] https://github.com/elastic/elasticsearch/pull/91343#discussion_r1015387753: confirm whether the new comment is sufficient
- [ ] https://github.com/elastic/elasticsearch/pull/91343#discussion_r1015436861: is `findLatest` doing the right thing?
- [x] https://github.com/elastic/elasticsearch/pull/91343#discussion_r1015625531: is `PendingListenersQueue#completeAllAsNotMaster()` safe? (#91428)
- [x] https://github.com/elastic/elasticsearch/pull/91343#discussion_r1016235756: `ContinuousComputation` has lacklustre rejection handling (#91442)
- [ ] https://github.com/elastic/elasticsearch/pull/91343#discussion_r1016240398: make explicit the assumption that `onNewInput` is called in order with increasing indices (#91443)
- [ ] https://github.com/elastic/elasticsearch/pull/91343#discussion_r1016254823: do we want to bail out on all empty balances or just `INITIAL`?
- [ ] https://github.com/elastic/elasticsearch/pull/91343#discussion_r1016269907: should we also use `canAllocate(shard, allocation)` to short-circuit cases where a shard cannot be assigned anywhere?
- [ ] Possible improvements to code copied from existing implementation:
  - [ ] https://github.com/elastic/elasticsearch/pull/91343#discussion_r1015444100: should `failAllocationOfNewPrimaries` check the recovery source?
  - [ ] https://github.com/elastic/elasticsearch/pull/91343#discussion_r1015445384: use `!=` instead of `^`.
  - [ ] https://github.com/elastic/elasticsearch/pull/91343#discussion_r1015449380: `.compareTo() == 0` vs `.equals()`
  - [ ] https://github.com/elastic/elasticsearch/pull/91343#discussion_r1015463215: comparator efficiency
- [ ] Update `wait_for_no_initializing_shards` and `wait_for_no_relocating_shards` to not exit immediately if there is an ongoing desired balance computation as it might trigger shards initializing or relocating.
- [x] run computation with limited `cluster_concurrent_rebalance` to avoid unnecessary shard movements during the desired balance computation https://github.com/elastic/elasticsearch/pull/93977
- [x] cleanup desired balance once a new master is elected (https://github.com/elastic/elasticsearch/pull/95450)
- [x] be able to manually reset or recompute desired balance from scratch (https://github.com/elastic/elasticsearch/pull/94525)
- [x] automatically detect (and log) if desired balance started to deviate from the current state too much (by a configured fraction of shards) (https://github.com/elastic/elasticsearch/pull/95458)anywhere else from assigned nodes in desired balance
- [x] node shutdown may be stuck if desired balance computation computes additional moves (B -> C) after node replacement move (A -> B) as NodeReplacementAllocationDecider would not permit direct move (A -> C) (https://github.com/elastic/elasticsearch/pull/95070)
- [x] in case desired balance is diverged from current balance a lot, prioritize shard movements that would improve the balance (away from fuller nodes to emptier nodes) (https://github.com/elastic/elasticsearch/pull/95454)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Follow-up work for desired balance allocator #91386

Allocator behavior

Stats

API improvements

Other

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Follow-up work for desired balance allocator #91386

Description

Allocator behavior

Stats

API improvements

Other

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions