Improve metric query performance

There are a number of performance issues that have been found in production cluster for metric solutions that need to be addressed in order to have competitive query latency in the metric space. This is part of the tsdb effort as it aim is to make Elasticsearch better at storing and querying metric data. Tasks mentioned here are improvements that significantly reduce query time of many metric query workloads or specific ones. 

Our current observations indicate that the poor performance is caused by the default refresh behaviour. Shards by default go search-idle after 30 seconds of search inactivity. When a shard is queries that is search idle then a refresh is performed as part of the search and then search execution continues. This adds a significant amount of latency to the query time. Especially because the refresh isn't triggered, but awaits until the scheduled refresh kicks in (which means often for 1 second nothing happens).

Additionally we observed that any search with a `percentile` aggregation is slow. Under the hood the `percentile` aggregation uses avl t-digest to compute the percentiles. This shows up as significant hotspot when profiling.

- [x] Build new Rally track that measure performance when shards go search-idle. elastic/rally-tracks#373 
- [x] #95544 
- [x] #95541
- [x] Improve the performance of `percentile` aggregation by switching to the merging based t-digest implementation. The current avl based implementation performs slowly in production with metric data set of any reasonable size. This work consists out of forking the t-digest library (#95903)) and then change the implementation to merging t-digest (#35182).
- [x] Improve `cardinality` aggregation performance on low cardinality fields (#92060). 
- [ ] Better detect when execution hint `map` or `global_ordinals` should be used.
- [x] elastic/kibana#157837

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve metric query performance #95776

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Improve metric query performance #95776

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions