Fix Race in testGetSnapshotsRequest #61694

original-brownbear · 2020-08-31T07:19:54Z

The fact that the data node is already blocked on writing
data files did not guarantee that the cluster state that made
the data node start snapshotting is already applied on master.
This could lead to races where the get snapshots action still
runs based on a state without the snapshot in it, tripping the assertion.
Much safer to handle this by waiting on the non-blocking snapshot create
to return, which guarantees that the CS has been applied on master.

Closes #61541

The fact that the data node is already blocked on writing data files did not guarantee that the cluster state that made the data node start snapshotting is already applied on master. This could lead to races where the get snapshots action still runs based on a state without the snapshot in it, tripping the assertion. Much safer to handle this by waiting on the non-blocking snapshot create to return, which guarantees that the CS has been applied on master. Closes elastic#61541

elasticmachine · 2020-08-31T07:19:56Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

original-brownbear · 2020-08-31T07:20:36Z

Credit here goes to @fcofdez who spotted this race in #61541 (comment) !

original-brownbear · 2020-08-31T07:23:35Z

Jenkins run elasticsearch-ci/packaging-sample-windows

fcofdez

LGTM 👍

original-brownbear · 2020-08-31T08:24:40Z

Thanks Francisco!

The fact that the data node is already blocked on writing data files did not guarantee that the cluster state that made the data node start snapshotting is already applied on master. This could lead to races where the get snapshots action still runs based on a state without the snapshot in it, tripping the assertion. Much safer to handle this by waiting on the non-blocking snapshot create to return, which guarantees that the CS has been applied on master. Closes elastic#61541

The fact that the data node is already blocked on writing data files did not guarantee that the cluster state that made the data node start snapshotting is already applied on master. This could lead to races where the get snapshots action still runs based on a state without the snapshot in it, tripping the assertion. Much safer to handle this by waiting on the non-blocking snapshot create to return, which guarantees that the CS has been applied on master. Closes #61541

original-brownbear added >test Issues or PRs that are addressing/adding tests :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v8.0.0 v7.10.0 v7.9.1 labels Aug 31, 2020

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Aug 31, 2020

fcofdez approved these changes Aug 31, 2020

View reviewed changes

original-brownbear merged commit 7102dd8 into elastic:master Aug 31, 2020

original-brownbear deleted the 61541 branch August 31, 2020 08:24

original-brownbear mentioned this pull request Aug 31, 2020

Fix Race in testGetSnapshotsRequest (#61694) #61700

Merged

original-brownbear mentioned this pull request Aug 31, 2020

Fix Race in testGetSnapshotsRequest (#61694) #61701

Merged

original-brownbear restored the 61541 branch December 6, 2020 19:03

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Race in testGetSnapshotsRequest #61694

Fix Race in testGetSnapshotsRequest #61694

Uh oh!

original-brownbear commented Aug 31, 2020

Uh oh!

elasticmachine commented Aug 31, 2020

Uh oh!

original-brownbear commented Aug 31, 2020

Uh oh!

original-brownbear commented Aug 31, 2020

Uh oh!

fcofdez left a comment

Uh oh!

original-brownbear commented Aug 31, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix Race in testGetSnapshotsRequest #61694

Fix Race in testGetSnapshotsRequest #61694

Uh oh!

Conversation

original-brownbear commented Aug 31, 2020

Uh oh!

elasticmachine commented Aug 31, 2020

Uh oh!

original-brownbear commented Aug 31, 2020

Uh oh!

original-brownbear commented Aug 31, 2020

Uh oh!

fcofdez left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Aug 31, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants