Skip to content

Commit 3460dc9

Browse files
authored
[DOCS] Percentile aggs are non-deterministic (#50468)
Percentile aggregations are non-deterministic. A percentile aggregation can produce different results even when using the same data. Based on [this discuss post][0], the non-deterministic property stems from processes in Lucene that can affect the order in which docs are provided to the aggregation. This adds a warning stating that the aggregation is non-deterministic and what that means. [0]: https://discuss.elastic.co/t/different-results-for-same-query/111757
1 parent 769650e commit 3460dc9

File tree

1 file changed

+7
-0
lines changed

1 file changed

+7
-0
lines changed

docs/reference/aggregations/metrics/percentile-aggregation.asciidoc

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -251,6 +251,13 @@ for large number of values is that the law of large numbers makes the distributi
251251
values more and more uniform and the t-digest tree can do a better job at summarizing
252252
it. It would not be the case on more skewed distributions.
253253

254+
[WARNING]
255+
====
256+
Percentile aggregations are also
257+
https://en.wikipedia.org/wiki/Nondeterministic_algorithm[non-deterministic].
258+
This means you can get slightly different results using the same data.
259+
====
260+
254261
[[search-aggregations-metrics-percentile-aggregation-compression]]
255262
==== Compression
256263

0 commit comments

Comments
 (0)