-
Notifications
You must be signed in to change notification settings - Fork 25.6k
Closed
Labels
:Search Relevance/AnalysisHow text is split into tokensHow text is split into tokens>docsGeneral docs changesGeneral docs changesTeam:Search RelevanceMeta label for the Search Relevance team in ElasticsearchMeta label for the Search Relevance team in Elasticsearch
Description
The Minhash Token Filter documentation only describes the interface for the token filter. That is fine for most token filters, but this one is more complicated.
- It should list possible use cases such as an alternative to the "more like this" query.
- It should talk about the recommended number of shingles: 5.
- It should give small but complete examples for 1 and 2.
In the Lucene issue, they discuss Jaccard and cosine similarities. Did that make it into the final patch? If so, should that be exposed as a setting?
ebenoist, bartvanremortele, synhershko, alizx, hsen-dev and 6 more
Metadata
Metadata
Assignees
Labels
:Search Relevance/AnalysisHow text is split into tokensHow text is split into tokens>docsGeneral docs changesGeneral docs changesTeam:Search RelevanceMeta label for the Search Relevance team in ElasticsearchMeta label for the Search Relevance team in Elasticsearch