Skip to content

Conversation

@jrodewig
Copy link
Contributor

Changes:

  • Rewrites description and adds Lucene link
  • Adds analyze example
  • Rewrites parameter documentation
  • Updates custom analyzer and filter examples
  • Adds anchor to index.max_shingle_diff index-level setting

Changes:

* Rewrites description and adds Lucene link
* Adds analyze example
* Rewrites parameter documentation
* Updates custom analyzer and filter examples
* Adds anchor to `index.max_shingle_diff` index-level setting
@jrodewig jrodewig added >docs General docs changes :Search Relevance/Analysis How text is split into tokens labels May 21, 2020
@jrodewig jrodewig requested a review from mayya-sharipova May 21, 2020 13:32
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-docs (>docs)

@elasticmachine elasticmachine added the Team:Docs Meta label for docs team label May 21, 2020
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-search (:Search/Analysis)

@elasticmachine elasticmachine added the Team:Search Meta label for search team label May 21, 2020
Copy link
Contributor

@mayya-sharipova mayya-sharipova left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@jrodewig Thanks! This looks very good to me.

difference between `max_shingle_size` and `min_shingle_size`.
Add shingles, or word https://en.wikipedia.org/wiki/N-gram[n-grams], to a token
stream by concatenating adjacent tokens. By default, the `shingle` token filter
creates two-word shingles.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should we also say "two-word shingles and unigrams"?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good catch. I've updated this to "outputs two-word shingles and unigrams".

@jrodewig
Copy link
Contributor Author

Thanks @mayya-sharipova

@jrodewig jrodewig merged commit 00ab16f into elastic:master May 21, 2020
@jrodewig jrodewig deleted the docs__reformat-shingle-tokenfilter branch May 21, 2020 17:41
jrodewig added a commit that referenced this pull request May 21, 2020
Changes:

* Rewrites description and adds Lucene link
* Adds analyze example
* Rewrites parameter documentation
* Updates custom analyzer and filter examples
* Adds anchor to `index.max_shingle_diff` index-level setting
jrodewig added a commit that referenced this pull request May 21, 2020
Changes:

* Rewrites description and adds Lucene link
* Adds analyze example
* Rewrites parameter documentation
* Updates custom analyzer and filter examples
* Adds anchor to `index.max_shingle_diff` index-level setting
jrodewig added a commit that referenced this pull request May 21, 2020
Changes:

* Rewrites description and adds Lucene link
* Adds analyze example
* Rewrites parameter documentation
* Updates custom analyzer and filter examples
* Adds anchor to `index.max_shingle_diff` index-level setting
@jrodewig
Copy link
Contributor Author

Backport commits

master 00ab16f
7.x a2de43d
7.8 85da75d
7.7 902f0cd

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

>docs General docs changes :Search Relevance/Analysis How text is split into tokens Team:Docs Meta label for docs team Team:Search Meta label for search team

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants