More advices around search speed and disk usage. #25252

jpountz · 2017-06-15T13:13:05Z

It adds notes about:

how preference can help optimize cache usage
the fact that too many replicas can hurt search performance due to lower
utilization of the filesystem cache
how index sorting can improve _source compression
how always putting fields in the same order in documents can improve _source
compression

It adds notes about: - how preference can help optimize cache usage - the fact that too many replicas can hurt search performance due to lower utilization of the filesystem cache - how index sorting can improve _source compression - how always putting fields in the same order in documents can improve _source compression

nik9000

Left a few minor things but LGTM.

nik9000 · 2017-06-15T15:06:23Z

docs/reference/how-to/disk-usage.asciidoc

+[float]
+=== Use index sorting to colocate similar documents
+
+Elasticsearch compresses multiple documents at once in order to improve the


Maybe something like "When Elasticsearch stores _source, it compresses multiple documents at once to improve the overall compression ratio...." so we don't get people thinking that doc values and the inverted index bits are stored like this.

nik9000 · 2017-06-15T15:06:57Z

docs/reference/how-to/disk-usage.asciidoc

+Elasticsearch compresses multiple documents at once in order to improve the
+overall compression ratio. For instance it is very common that documents share
+the same field names, and quite common that they share some field values,
+especially on fields that have a low cardinality or a zipfian distribution.


zipfian might deserve a link to wikipedia.

nik9000 · 2017-06-15T15:10:20Z

docs/reference/how-to/disk-usage.asciidoc

+the same field names, and quite common that they share some field values,
+especially on fields that have a low cardinality or a zipfian distribution.
+
+Documents that are compressed together are documents that are colocated in the


I'd twist this around to something like "By default documents are compressed together in the order that they are added to the index. If you enabled index sorting then instead they are compressed in sorted order. Sorting documents with similar structure, fields, and values together should improve the compression ratio." Or something like that. It feels more active that way. I dunno.

nik9000 · 2017-06-15T15:14:44Z

docs/reference/how-to/search-speed.asciidoc

+=== Use `preference` to optimize cache utilization
+
+There are multiple caches that can help with search performance, such as the
+filesystem cache, the <<shard-request-cache,request cache>> or the


Maybe make filesystem cache a link to https://en.wikipedia.org/wiki/Page_cache ?

nik9000 · 2017-06-15T15:15:49Z

docs/reference/how-to/search-speed.asciidoc

+filesystem cache, the <<shard-request-cache,request cache>> or the
+<<query-cache,query cache>>. Yet all these caches are maintained at the node
+level, meaning that if you run the same request twice in a row, have 1
+<<glossary-replica-shard,replica>> or more and use the default routing


I'd s/use the default routing algorithm, which is round-robin,/use round-robin, the default routing algorithm/

martijnvg

LGTM

…y-context * 'master' of github.com:elastic/elasticsearch: (21 commits) [DOCS] Clarify expected availability of HDFS for the HDFS Repository (elastic#25220) Remove some redundant 140 character checkstyle suppressions [Docs] more fix for the parent-join docs [Docs] Fix cross reference for parent-join field More advices around search speed and disk usage. (elastic#25252) Add documentation for the new parent-join field (elastic#25227) [analysis-icu] Allow setting unicodeSetFilter (elastic#20814) Introduce translog size and age based retention policies (elastic#25147) Add needs methods for specific variables to Painless script context factories. (elastic#25267) Improves snapshot logging and snapshoth deletion error handling (elastic#25264) Add unit test for PathHierarchyTokenizerFactory (elastic#24984) Deprecate tribe service Moved more token filters to analysis-common module. [Test] Make sure that SearchAfterSortedDocQueryTests uses a single threaded searcher [DOCS] Defined es-test-dir and plugins-examples-dir in index.asciidoc. (elastic#25232) Test fix - removed superfluous assertion (elastic#25247) [Test] restore BWC for parent-join now that the new mapping format is in 5.x Add a section named "relations" in the ParentJoinFieldMapper (elastic#25248) test: Ported more OldIndexBackwardsCompatibilityIT tests to full cluster restart qa tests. (elastic#25173) fix: Sort Processor does not have proper behavior with targetField (elastic#25237) ...

jpountz added the >docs General docs changes label Jun 15, 2017

nik9000 approved these changes Jun 15, 2017

View reviewed changes

martijnvg approved these changes Jun 15, 2017

View reviewed changes

iter

810edce

jpountz merged commit 8c869e2 into elastic:master Jun 16, 2017

jpountz deleted the docs/preference_search_speed branch June 16, 2017 09:23

jpountz added the v6.0.0 label Jun 16, 2017

clintongormley added v6.0.0-beta1 and removed v6.0.0 labels Jul 25, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

More advices around search speed and disk usage. #25252

More advices around search speed and disk usage. #25252

Uh oh!

jpountz commented Jun 15, 2017

Uh oh!

nik9000 left a comment

Uh oh!

nik9000 Jun 15, 2017

Uh oh!

nik9000 Jun 15, 2017

Uh oh!

nik9000 Jun 15, 2017

Uh oh!

nik9000 Jun 15, 2017

Uh oh!

nik9000 Jun 15, 2017

Uh oh!

nik9000 Jun 15, 2017

Uh oh!

martijnvg left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

More advices around search speed and disk usage. #25252

More advices around search speed and disk usage. #25252

Uh oh!

Conversation

jpountz commented Jun 15, 2017

Uh oh!

nik9000 left a comment

Choose a reason for hiding this comment

Uh oh!

nik9000 Jun 15, 2017

Choose a reason for hiding this comment

Uh oh!

nik9000 Jun 15, 2017

Choose a reason for hiding this comment

Uh oh!

nik9000 Jun 15, 2017

Choose a reason for hiding this comment

Uh oh!

nik9000 Jun 15, 2017

Choose a reason for hiding this comment

Uh oh!

nik9000 Jun 15, 2017

Choose a reason for hiding this comment

Uh oh!

nik9000 Jun 15, 2017

Choose a reason for hiding this comment

Uh oh!

martijnvg left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants