Skip to content

Conversation

@original-brownbear
Copy link
Contributor

@original-brownbear original-brownbear added >non-issue :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v7.0.0 labels Feb 5, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-distributed

@original-brownbear original-brownbear changed the title Fix Master Failover and DataNode Leave Blocking Snapshot Fix Master Failover and DataNode Leave Blocked Snapshot Feb 5, 2019
@original-brownbear
Copy link
Contributor Author

@ywelsch unfortunately, I haven't been able to get a test in SnapshotResiliencyTests to reproduce this (looks like there's some behavior difference there with the master failover and node removal timing relative to the real thing), but I think this is a valid fix and can be reviewed/merged.
It simply goes back to the behavior from pre #38368 for master failover so I think it's safe and fixes tests.

I'll continue to try and find a more deterministic test for this tomorrow though :)

@original-brownbear
Copy link
Contributor Author

@ywelsch thanks!

@original-brownbear original-brownbear merged commit 34f2cc7 into elastic:master Feb 5, 2019
@original-brownbear original-brownbear deleted the 38447 branch February 5, 2019 22:57
jasontedor added a commit to jasontedor/elasticsearch that referenced this pull request Feb 11, 2019
* master:
  Add an authentication cache for API keys (elastic#38469)
  Fix exit code in certutil packaging test (elastic#38393)
  Enable logs for intermittent test failure (elastic#38426)
  Disable BWC to backport recovering retention leases (elastic#38477)
  Enable bwc tests now that elastic#38443 is backported. (elastic#38462)
  Fix Master Failover and DataNode Leave Blocking Snapshot (elastic#38460)
  Recover retention leases during peer recovery (elastic#38435)
  Set update mappings mater node timeout to 30 min (elastic#38439)
  Assert job is not null in FullClusterRestartIT (elastic#38218)
  Update ilm-api.asciidoc, point to REMOVE policy (elastic#38235) (elastic#38463)
  SQL: Fix esType for DATETIME/DATE and INTERVALS (elastic#38179)
  Handle deprecation header-AbstractUpgradeTestCase (elastic#38396)
  XPack: core/ccr/Security-cli migration to java-time (elastic#38415)
  Disable bwc tests for elastic#38443 (elastic#38456)
  Bubble-up exceptions from scheduler (elastic#38317)
  Re-enable TasksClientDocumentationIT.testCancelTasks (elastic#38234)
  Allow custom authorization with an authorization engine  (elastic#38358)
  CRUDDocumentationIT fix documentation references
  Remove support for internal versioning for concurrency control (elastic#38254)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

:Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs >non-issue v7.0.0-beta1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants