-
Notifications
You must be signed in to change notification settings - Fork 25.6k
Update tika version to 1.15 #25003
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update tika version to 1.15 #25003
Conversation
|
Since this is a community submitted pull request, a Jenkins build has not been kicked off automatically. Can an Elastic organization member please verify the contents of this patch and then kick off a build manually? |
jasontedor
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think this will pass gradle precommit, let alone gradle check.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There's more that will be needed than this, for example, updating the SHAs. Did you run gradle check?
|
You were right, I ran checks, everything looks good.
Should I exclude them as well? |
dadoonet
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The change looks OK. That said I know 1.15.1 will be released soonish as per https://twitter.com/_tallison/status/880140793553596418. Should we wait a bit? @jasontedor WDYT?
|
@dadoonet I think if this build is green we should get this in as-is. I know how release processes go, we could be waiting some uncertain period. A bird in the hand is worth two in the bush. |
|
test this please |
|
retest this please |
|
@jasontedor, would you like me to merge |
|
Yes, please do. |
|
@jasontedor I think since this has had master merged it's ready to be merged, is there any other review you want? (you're still marked as requesting changes which is why I pinged you) |
|
test this please |
* master: (458 commits) Prevent cluster internal `ClusterState.Custom` impls to leak to a client (elastic#26232) Add packaging test for systemd runtime directive [TEST] Reenable RareClusterStateIt#testDeleteCreateInOneBulk Serialize and expose timeout of acknowledged requests in REST layer (elastic#26189) (refactor) some opportunities to use diamond operator (elastic#25585) [DOCS] Clarified readme for testing a single page Settings: Add keystore.seed auto generated secure setting (elastic#26149) Update version information (elastic#25226) "result" : created -> "result" : "created" (elastic#25446) Set RuntimeDirectory (elastic#23526) Drop upgrade from full cluster restart tests (elastic#26224) Further improve docs for requests_per_second Docs disambiguate reindex's requests_per_second (elastic#26185) [DOCS] Cleanup link for ec2 discovery (elastic#26222) Fix document field equals and hash code test Use holder pattern for lazy deprecation loggers Settings: Add keystore creation to add commands (elastic#26126) Docs: Cleanup docs for ec2 discovery (elastic#26065) Fix NPE when `values` is omitted on percentile_ranks agg (elastic#26046) Several internal improvements to internal test cluster infra (elastic#26214) ...
|
retest this please |
dadoonet
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wonder if this is working when Tesseract is installed on the same platform. I read somewhere that Tika now runs OCR by default from 1.15 and I'm pretty sure we need to disable it.
I can't check now as I'm on holidays and AFK. :)
But I prefer saying now if anyone would like to check before I'm back.
|
@elasticmachine test this please |
|
I ran a manual test with Tesseract installed locally and the current patch passed the integration tests. |
|
@jasontedor WDYT? Should we merge it as is or update this branch to 1.16 first? |
|
We should merge this as-is (assuming CI is green). |
|
retest this please |
* master: (556 commits) Fix find remote when building BWC Remove colons from task and configuration names Add unreleased 5.6.5 version number testCreateSplitIndexToN: do not set `routing_partition_size` to >= `number_of_routing_shards` Snapshot/Restore: better handle incorrect chunk_size settings in FS repo (elastic#26844) Add limits for ngram and shingle settings (elastic#27211) (elastic#27318) Correct comment in index shard test Roll translog generation on primary promotion ObjectParser: Replace IllegalStateException with ParsingException (elastic#27302) scripted_metric _agg parameter disappears if params are provided (elastic#27159) Update discovery-ec2.asciidoc Update shrink's bwc version to 6.1.0 and enabled bwc tests Add limits for ngram and shingle settings (elastic#27211) Disable bwc tests in preparation of backporting elastic#26931 TemplateUpgradeService should only run on the master (elastic#27294) Die with dignity while merging Fix profiling naming issues (elastic#27133) Correctly encode warning headers Fixed references to Multi Index Syntax (elastic#27283) Add an active Elasticsearch WordPress plugin link (elastic#27279) ...
|
retest this please |
jasontedor
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
This commit upgrades the Tika dependency to version 1.15. Relates #25003
This commit upgrades the Tika dependency to version 1.15. Relates #25003
|
I have integrated this into 5.6, 6.x, and master. This still needs to be merged into 6.0 after GA. |
|
@jasontedor I pushed 91a23de to fix a version number issue in the packaging tests. Please don't forget to backport this commit when backporting your change in 6.0. |
This commit upgrades the Tika dependency to version 1.15. Relates #25003
Close #24969