Skip to content

Data tier ergonomics #70618

@jpountz

Description

@jpountz

The formalization of data tiers (#60994) enables us to ship with defaults that are better suited for the expected usage pattern.

Content indices

If we can expect content indices to be more actively searched than indexed, we could make the following changes, which would trade indexing speed for search speed or

  • Number of segments per tier set to 5 (default: 10)?
  • Maximum percentage of deletes set to 20% (default: 33%)?
  • Enable merge-on-refresh (default: disabled)?
  • Refresh interval set to 1s (default: disabled on inactive shards, 1s on active shards)?

Hot indices

Our defaults tend to be geared towards hot data, is there any setting that could have a more sensible value on hot data?

Warm indices

Warm, cold and frozen indices are expected to be allocated to dense storage. Memory needs to be wisely used.

Cold indices

Frozen indices

Metadata

Metadata

Assignees

No one assigned

    Labels

    :Search/SearchSearch-related issues that do not fall into other categories>enhancementMetaTeam:SearchMeta label for search team

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions