Always rebuild checkpoint tracker for old indices #46340

dnhatn · 2019-09-04T18:57:08Z

The max_seq_no of Lucene commit of the old indices (before 6.6.2) can be smaller than seq_no of some documents in the commit (see #38879). Although we fixed this bug in 6.6.2 and 7.0.0, a problematic index commit can still affect the newer version after a rolling upgrade or full cluster restart. In particular, if a FollowingEngine (or InternalEngine with MSU enabled) restores from a problematic commit, then it can apply MSU optimization for existing documents. The symptom that we see here is the local checkpoint tracker assertion is violated.

Closes #46311
Relates #38879

elasticmachine · 2019-09-04T18:57:10Z

Pinging @elastic/es-distributed

ywelsch

I've asked for two cosmetic changes, o.w. looking good.

server/src/main/java/org/elasticsearch/common/lucene/Lucene.java

server/src/main/java/org/elasticsearch/index/engine/InternalEngine.java

henningandersen

LGTM.

server/src/main/java/org/elasticsearch/index/engine/InternalEngine.java

dnhatn · 2019-09-05T21:18:29Z

run elasticsearch-ci/2

dnhatn · 2019-09-07T03:10:40Z

@ywelsch and @henningandersen Thanks for reviewing.

Always rebuild checkpoint tracker for old indices

98c865a

dnhatn added >bug :Distributed Indexing/Engine Anything around managing Lucene and the Translog in an open shard. v6.8.4 labels Sep 4, 2019

dnhatn requested review from henningandersen and ywelsch September 4, 2019 18:57

ywelsch approved these changes Sep 5, 2019

View reviewed changes

server/src/main/java/org/elasticsearch/common/lucene/Lucene.java Outdated Show resolved Hide resolved

server/src/main/java/org/elasticsearch/index/engine/InternalEngine.java Show resolved Hide resolved

henningandersen approved these changes Sep 5, 2019

View reviewed changes

server/src/main/java/org/elasticsearch/index/engine/InternalEngine.java Show resolved Hide resolved

yannick’s feedback

797862b

dnhatn merged commit eae6361 into elastic:6.8 Sep 7, 2019

dnhatn deleted the rebuild_checkpoint_tracker branch September 7, 2019 03:11

dnhatn mentioned this pull request Sep 7, 2019

[CI] checkpoint tracker failure in 6.8 cluster upgrade test #46311

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Always rebuild checkpoint tracker for old indices #46340

Always rebuild checkpoint tracker for old indices #46340

Uh oh!

dnhatn commented Sep 4, 2019 •

edited

Loading

Uh oh!

elasticmachine commented Sep 4, 2019

Uh oh!

ywelsch left a comment

Uh oh!

Uh oh!

Uh oh!

henningandersen left a comment

Uh oh!

Uh oh!

dnhatn commented Sep 5, 2019

Uh oh!

dnhatn commented Sep 7, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Always rebuild checkpoint tracker for old indices #46340

Always rebuild checkpoint tracker for old indices #46340

Uh oh!

Conversation

dnhatn commented Sep 4, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticmachine commented Sep 4, 2019

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

henningandersen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dnhatn commented Sep 5, 2019

Uh oh!

dnhatn commented Sep 7, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dnhatn commented Sep 4, 2019 •

edited

Loading