Desired-balance warn threshold logging should accumulate across restarts

Today we emit periodic INFO logs about an ongoing desired balance computation which has not converged after some amount of time or some number of iterations:

https://github.com/elastic/elasticsearch/blob/18f960c4eb83ce5fc03be3b1ea7ce985f15f2d52/server/src/main/java/org/elasticsearch/cluster/routing/allocation/allocator/DesiredBalanceComputer.java#L321-L329

However these numbers reset if a new cluster state is received, so it's possible for a steady stream of cluster states to prevent the computation for converging without ever seeing any warnings. IMO we should let these numbers accumulate until the computation fully converges, and report the number of restarts since convergence in the log message too.

	logger.log(
	reportByIterationCount \|\| reportByTime ? Level.INFO : i % 100 == 0 ? Level.DEBUG : Level.TRACE,
	() -> Strings.format(
	"Desired balance computation for [%d] is still not converged after [%s] and [%d] iterations",
	desiredBalanceInput.index(),
	TimeValue.timeValueMillis(currentTime - computationStartedTime).toString(),
	iterations
	)
	);

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Desired-balance warn threshold logging should accumulate across restarts #100850

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Desired-balance warn threshold logging should accumulate across restarts #100850

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions