Skip to content

[CI] NetworkDisruptionIT.testJobRelocation failing #49908

@droberts195

Description

@droberts195

There have been a few recent failures of NetworkDisruptionIT.testJobRelocation:

In all cases the error is:

org.elasticsearch.xpack.ml.integration.NetworkDisruptionIT > testJobRelocation FAILED
    java.lang.AssertionError
        at org.junit.Assert.fail(Assert.java:86)
        at org.junit.Assert.assertTrue(Assert.java:41)
        at org.junit.Assert.assertNotNull(Assert.java:712)
        at org.junit.Assert.assertNotNull(Assert.java:722)
        at org.elasticsearch.xpack.ml.support.BaseMlIntegTestCase.lambda$awaitJobOpenedAndAssigned$6(BaseMlIntegTestCase.java:400)
        at org.elasticsearch.test.ESTestCase.assertBusy(ESTestCase.java:891)
        at org.elasticsearch.test.ESTestCase.assertBusy(ESTestCase.java:864)
        at org.elasticsearch.xpack.ml.support.BaseMlIntegTestCase.awaitJobOpenedAndAssigned(BaseMlIntegTestCase.java:395)
        at org.elasticsearch.xpack.ml.integration.NetworkDisruptionIT.testJobRelocation(NetworkDisruptionIT.java:70)

    java.util.concurrent.ExecutionException: Failed to execute phase [query], 

        Caused by:
        Failed to execute phase [query], 

            Caused by:
            Failed to execute phase [query], Search rejected due to missing shards [[.ml-state][0]]. Consider using `allow_partial_search_results` setting to bypass this error.

This doesn't reproduce easily. It probably means there's somewhere in the test where we need to wait for the .ml-state index to have a usable replica prior to simulating the network disruption.

Metadata

Metadata

Assignees

No one assigned

    Labels

    :mlMachine learning>test-failureTriaged test failures from CI

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions