-
Notifications
You must be signed in to change notification settings - Fork 25.6k
Closed
Labels
:Distributed Indexing/RecoveryAnything around constructing a new shard, either from a local or a remote source.Anything around constructing a new shard, either from a local or a remote source.>test-failureTriaged test failures from CITriaged test failures from CIv8.0.0-alpha1
Description
I see three CI test failures on master that seem to share a pattern. I can't reproduce the failure locally.
There is a lot of failure output, including three distinct exceptions. The first is an AssertionError about a timeout:
org.elasticsearch.indices.state.CloseWhileRelocatingShardsIT > testCloseWhileRelocatingShards FAILED
java.lang.AssertionError: timed out waiting for green state
at org.junit.Assert.fail(Assert.java:88)
at org.elasticsearch.test.ESIntegTestCase.ensureColor(ESIntegTestCase.java:887)
at org.elasticsearch.test.ESIntegTestCase.ensureGreen(ESIntegTestCase.java:840)
at org.elasticsearch.test.ESIntegTestCase.ensureGreen(ESIntegTestCase.java:829)
at org.elasticsearch.indices.state.CloseWhileRelocatingShardsIT.testCloseWhileRelocatingShards(CloseWhileRelocatingShardsIT.java:120)
The second is an assertion error about pending shard operations:
java.lang.AssertionError: shard [index-3][0] on node [node_sd1] has pending operations:
--> BulkShardRequest [[index-3][0]] containing [5] requests
at org.elasticsearch.index.shard.IndexShardOperationPermits.acquire(IndexShardOperationPermits.java:233)
at org.elasticsearch.index.shard.IndexShard.acquirePrimaryOperationPermit(IndexShard.java:2689)
at org.elasticsearch.action.support.replication.TransportReplicationAction.acquirePrimaryOperationPermit(TransportReplicationAction.java:835)
at org.elasticsearch.action.support.replication.TransportReplicationAction$AsyncPrimaryAction.doRun(TransportReplicationAction.java:292)
at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:37)
at org.elasticsearch.action.support.replication.TransportReplicationAction.handlePrimaryRequest(TransportReplicationAction.java:255)
at org.elasticsearch.transport.RequestHandlerRegistry.processMessageReceived(RequestHandlerRegistry.java:63)
at org.elasticsearch.transport.TransportService$7.doRun(TransportService.java:696)
at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingAbstractRunnable.doRun(ThreadContext.java:747)
at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:37)
at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
at java.base/java.lang.Thread.run(Thread.java:834)
at org.elasticsearch.test.InternalTestCluster.lambda$assertNoPendingIndexOperations$11(InternalTestCluster.java:1157)
at org.elasticsearch.test.ESTestCase.assertBusy(ESTestCase.java:887)
at org.elasticsearch.test.InternalTestCluster.assertNoPendingIndexOperations(InternalTestCluster.java:1148)
at org.elasticsearch.test.InternalTestCluster.beforeIndexDeletion(InternalTestCluster.java:1142)
at org.elasticsearch.test.ESIntegTestCase.beforeIndexDeletion(ESIntegTestCase.java:575)
at org.elasticsearch.test.ESIntegTestCase.afterInternal(ESIntegTestCase.java:545)
at org.elasticsearch.test.ESIntegTestCase.cleanUpCluster(ESIntegTestCase.java:1936)
[…]
Finally, there's a thread leak error:
org.elasticsearch.indices.state.CloseWhileRelocatingShardsIT > classMethod FAILED
com.carrotsearch.randomizedtesting.ThreadLeakError: 3 threads leaked from SUITE scope at org.elasticsearch.indices.state.CloseWhileRelocatingShardsIT:
1) Thread[id=1114, name=Thread-6, state=RUNNABLE, group=TGRP-CloseWhileRelocatingShardsIT]
at [email protected]/java.util.Spliterators$ArraySpliterator.tryAdvance(Spliterators.java:958)
at [email protected]/java.util.Spliterators$1Adapter.hasNext(Spliterators.java:681)
at app//org.elasticsearch.test.BackgroundIndexer$1.run(BackgroundIndexer.java:160)
2) Thread[id=1109, name=Thread-5, state=WAITING, group=TGRP-CloseWhileRelocatingShardsIT]
at [email protected]/jdk.internal.misc.Unsafe.park(Native Method)
at [email protected]/java.util.concurrent.locks.LockSupport.park(LockSupport.java:194)
at [email protected]/java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:885)
at [email protected]/java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1039)
at [email protected]/java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1345)
at app//org.elasticsearch.common.util.concurrent.BaseFuture$Sync.get(BaseFuture.java:252)
at app//org.elasticsearch.common.util.concurrent.BaseFuture.get(BaseFuture.java:87)
at app//org.elasticsearch.common.util.concurrent.FutureUtils.get(FutureUtils.java:56)
at app//org.elasticsearch.action.support.AdapterActionFuture.actionGet(AdapterActionFuture.java:37)
at app//org.elasticsearch.action.ActionRequestBuilder.get(ActionRequestBuilder.java:52)
at app//org.elasticsearch.test.BackgroundIndexer$1.run(BackgroundIndexer.java:159)
3) Thread[id=1090, name=Thread-4, state=WAITING, group=TGRP-CloseWhileRelocatingShardsIT]
at [email protected]/jdk.internal.misc.Unsafe.park(Native Method)
at [email protected]/java.util.concurrent.locks.LockSupport.park(LockSupport.java:194)
at [email protected]/java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:885)
at [email protected]/java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1039)
at [email protected]/java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1345)
at app//org.elasticsearch.common.util.concurrent.BaseFuture$Sync.get(BaseFuture.java:252)
at app//org.elasticsearch.common.util.concurrent.BaseFuture.get(BaseFuture.java:87)
at app//org.elasticsearch.common.util.concurrent.FutureUtils.get(FutureUtils.java:56)
at app//org.elasticsearch.action.support.AdapterActionFuture.actionGet(AdapterActionFuture.java:37)
at app//org.elasticsearch.action.ActionRequestBuilder.get(ActionRequestBuilder.java:52)
at app//org.elasticsearch.test.BackgroundIndexer$1.run(BackgroundIndexer.java:159)
at __randomizedtesting.SeedInfo.seed([6350B3BE3F809D73]:0)
There is a lot of other log output in the build logs.
Links to build scans:
- March 13: https://gradle-enterprise.elastic.co/s/tbjuwp756ed54
- March 2: https://gradle-enterprise.elastic.co/s/5dfhpzpdukz32
- February 25: https://gradle-enterprise.elastic.co/s/3ks6xx6vpt4ls
It looks like the last time we addressed a test failure here was #49403.
Metadata
Metadata
Assignees
Labels
:Distributed Indexing/RecoveryAnything around constructing a new shard, either from a local or a remote source.Anything around constructing a new shard, either from a local or a remote source.>test-failureTriaged test failures from CITriaged test failures from CIv8.0.0-alpha1