-
Notifications
You must be signed in to change notification settings - Fork 25.6k
Description
I just discovered that analysis ICU is still at version 54.1 in Elasticsearch 5.0 (same as ES 2.4) as you can see here: https://github.com/elastic/elasticsearch/blob/4b5f87cb7dcea80bca99f863a02e585b8c2c1e17/plugins/analysis-icu/build.gradle
ICU 54 is quite old now and the break iterator, collations and emoji support have been greatly improved since then (two years ago). See http://site.icu-project.org/download.
As updating ICU require a re-indexation, I'm wondering what is the path for power user who wants to benefits from the latest CLDR / Unicode features? I understand that updating Elasticsearch should not require a re-index operation, so I guess I should build my own icu-analysis plugin maybe?
The Lucene ICU components are also not up to date with ICU as they requires the version 56.1 instead of 58.1.
As this plugin depend on a library that is very often updated, maybe it's not the best choice to ship it with Elasticsearch directly? As an external plugin, it could follow the ICU versions without blocking cluster updates for users.
Thanks,
Damien