Skip to content

Conversation

@dnhatn
Copy link
Member

@dnhatn dnhatn commented Sep 4, 2019

The max_seq_no of Lucene commit of the old indices (before 6.6.2) can be smaller than seq_no of some documents in the commit (see #38879). Although we fixed this bug in 6.6.2 and 7.0.0, a problematic index commit can still affect the newer version after a rolling upgrade or full cluster restart. In particular, if a FollowingEngine (or InternalEngine with MSU enabled) restores from a problematic commit, then it can apply MSU optimization for existing documents. The symptom that we see here is the local checkpoint tracker assertion is violated.

Closes #46311
Relates #38879

@dnhatn dnhatn added >bug :Distributed Indexing/Engine Anything around managing Lucene and the Translog in an open shard. v6.8.4 labels Sep 4, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-distributed

Copy link
Contributor

@ywelsch ywelsch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've asked for two cosmetic changes, o.w. looking good.

Copy link
Contributor

@henningandersen henningandersen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@dnhatn
Copy link
Member Author

dnhatn commented Sep 5, 2019

run elasticsearch-ci/2

@dnhatn
Copy link
Member Author

dnhatn commented Sep 7, 2019

@ywelsch and @henningandersen Thanks for reviewing.

@dnhatn dnhatn merged commit eae6361 into elastic:6.8 Sep 7, 2019
@dnhatn dnhatn deleted the rebuild_checkpoint_tracker branch September 7, 2019 03:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

>bug :Distributed Indexing/Engine Anything around managing Lucene and the Translog in an open shard. v6.8.4

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants