-
Notifications
You must be signed in to change notification settings - Fork 25.6k
Closed
Labels
:Distributed Indexing/CRUDA catch all label for issues around indexing, updating and getting a doc by id. Not search.A catch all label for issues around indexing, updating and getting a doc by id. Not search.team-discuss
Description
Today we wait for 30 minutes to block all operations when performing a primary relocation handoff:
elasticsearch/server/src/main/java/org/elasticsearch/index/shard/IndexShard.java
Line 636 in 4bd8e7b
| indexShardOperationPermits.blockOperations(30, TimeUnit.MINUTES, () -> { |
However this behaves badly if a write task gets stuck, for instance in the case fixed in #36770. The effect is that the block cannot be put in place (because there is an in-flight operation) but the pending block prevents any further operations from taking place until it eventually fails.
I think we should consider reducing this timeout, because it seems preferable to fail the primary relocation (and therefore resume write operations) much sooner than 30 minutes.
Metadata
Metadata
Assignees
Labels
:Distributed Indexing/CRUDA catch all label for issues around indexing, updating and getting a doc by id. Not search.A catch all label for issues around indexing, updating and getting a doc by id. Not search.team-discuss