Skip to content

[CI] wipeRollupJobs fails to cleanup job and causes multiple test failures  #38877

@tvernum

Description

@tvernum

From https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+7.x+intake/92/consoleFull

The first failure seems to have been actually running the stop-job snippet:

23:26:34   1> [2019-02-14T00:24:33,170][ERROR][o.e.s.DocsClientYamlTestSuiteIT] [test] This failing test was generated by documentation starting at reference/rollup/apis/stop-job.asciidoc:line_54. It may include many snippets. See docs/README.asciidoc for an explanation of test generation.
23:26:34 ERROR   10.4s | DocsClientYamlTestSuiteIT.test {yaml=reference/rollup/apis/stop-job/line_54} <<< FAILURES!
23:26:34    > Throwable #1: org.elasticsearch.client.ResponseException: method [POST], host [http://[::1]:39829], URI [/_rollup/job/sensor/_stop?wait_for_completion=true&timeout=10s], status line [HTTP/1.1 500 Internal Server Error]
23:26:34    > {"error":{"root_cause":[{"type":"timeout_exception","reason":"Timed out after [10s] while waiting for rollup job [sensor] to stop. State was [STOPPING]"}],"type":"timeout_exception","reason":"Timed out after [10s] while waiting for rollup job [sensor] to stop. State was [STOPPING]"},"status":500}
23:26:34    > 	at __randomizedtesting.SeedInfo.seed([4C8232EC98ABC116:C4D60D363657ACEE]:0)
23:26:34    > 	at org.elasticsearch.client.RestClient.convertResponse(RestClient.java:260)
23:26:34    > 	at org.elasticsearch.client.RestClient.performRequest(RestClient.java:238)
23:26:34    > 	at org.elasticsearch.client.RestClient.performRequest(RestClient.java:212)
23:26:34    > 	at org.elasticsearch.test.rest.ESRestTestCase.wipeRollupJobs(ESRestTestCase.java:597)
23:26:34    > 	at org.elasticsearch.test.rest.ESRestTestCase.wipeCluster(ESRestTestCase.java:509)
23:26:34    > 	at org.elasticsearch.test.rest.ESRestTestCase.cleanUpCluster(ESRestTestCase.java:295)
23:26:34    > 	at java.lang.Thread.run(Thread.java:748)
23:26:34   1> [2019-02-14T00:24:33,179][INFO ][o.e.s.DocsClientYamlTestSuiteIT] [test] [yaml=reference/ml/detector-custom-rules/line_146] before test
23:26:34   1> [2019-02-14T00:24:43,241][INFO ][o.e.s.DocsClientY

But this caused other doc snippets to fail as they tried to clean up the cluster

23:26:34 ERROR   10.2s | DocsClientYamlTestSuiteIT.test {yaml=reference/indices/forcemerge/line_47} <<< FAILURES!
23:26:34    > Throwable #1: org.elasticsearch.client.ResponseException: method [POST], host [http://[::1]:39829], URI [/_rollup/job/sensor/_stop?wait_for_completion=true&timeout=10s], status line [HTTP/1.1 500 Internal Server Error]
23:26:34    > {"error":{"root_cause":[{"type":"timeout_exception","reason":"Timed out after [10s] while waiting for rollup job [sensor] to stop. State was [STOPPING]"}],"type":"timeout_exception","reason":"Timed out after [10s] while waiting for rollup job [sensor] to stop. State was [STOPPING]"},"status":500}
23:26:34    > 	at __randomizedtesting.SeedInfo.seed([4C8232EC98ABC116:C4D60D363657ACEE]:0)
23:26:34    > 	at org.elasticsearch.client.RestClient.convertResponse(RestClient.java:260)
23:26:34    > 	at org.elasticsearch.client.RestClient.performRequest(RestClient.java:238)
23:26:34    > 	at org.elasticsearch.client.RestClient.performRequest(RestClient.java:212)
23:26:34    > 	at org.elasticsearch.test.rest.ESRestTestCase.wipeRollupJobs(ESRestTestCase.java:597)
23:26:34    > 	at org.elasticsearch.test.rest.ESRestTestCase.wipeCluster(ESRestTestCase.java:509)
23:26:34    > 	at org.elasticsearch.test.rest.ESRestTestCase.cleanUpCluster(ESRestTestCase.java:295)
23:26:34    > 	at java.lang.Thread.run(Thread.java:748)
23:26:34   1> [2019-02-14T00:25:03,686][INFO ][o.e.s.DocsClientYamlTestSuiteIT] [test] [yaml=reference/how-to/search-speed/line_87] before test
23:26:34   1> [2019-02-14T00:25:13,898][INFO ][o.e.s.DocsClientYamlTestSuiteIT] [test] [yaml=reference/how-to/search-speed/line_87] after test
23:26:34   1> [2019-02-14T00:25:13,898][ERROR][o.e.s.DocsClientYamlTestSuiteIT] [test] This failing test was generated by documentation starting at reference/how-to/search-speed.asciidoc:line_87. It may include many snippets. See docs/README.asciidoc for an explanation of test generation.
23:26:34 ERROR   10.2s | DocsClientYamlTestSuiteIT.test {yaml=reference/how-to/search-speed/line_87} <<< FAILURES!
23:26:34    > Throwable #1: org.elasticsearch.client.ResponseException: method [POST], host [http://[::1]:39829], URI [/_rollup/job/sensor/_stop?wait_for_completion=true&timeout=10s], status line [HTTP/1.1 500 Internal Server Error]
23:26:34    > {"error":{"root_cause":[{"type":"timeout_exception","reason":"Timed out after [10s] while waiting for rollup job [sensor] to stop. State was [STOPPING]"}],"type":"timeout_exception","reason":"Timed out after [10s] while waiting for rollup job [sensor] to stop. State was [STOPPING]"},"status":500}
23:26:34    > 	at __randomizedtesting.SeedInfo.seed([4C8232EC98ABC116:C4D60D363657ACEE]:0)
23:26:34    > 	at org.elasticsearch.client.RestClient.convertResponse(RestClient.java:260)
23:26:34    > 	at org.elasticsearch.client.RestClient.performRequest(RestClient.java:238)
23:26:34    > 	at org.elasticsearch.client.RestClient.performRequest(RestClient.java:212)
23:26:34    > 	at org.elasticsearch.test.rest.ESRestTestCase.wipeRollupJobs(ESRestTestCase.java:597)
23:26:34    > 	at org.elasticsearch.test.rest.ESRestTestCase.wipeCluster(ESRestTestCase.java:509)
23:26:34    > 	at org.elasticsearch.test.rest.ESRestTestCase.cleanUpCluster(ESRestTestCase.java:295)
23:26:34    > 	at java.lang.Thread.run(Thread.java:748)
23:26:34   1> [2019-02-14T00:25:13,905][INFO ][o.e.s.DocsClientYamlTestSuiteIT] [test] [yaml=reference/aggregations/bucket/terms-aggregation/line_57] before test

With an end result of these failures

  • DocsClientYamlTestSuiteIT test {yaml=reference/frozen-indices/line_52}
  • DocsClientYamlTestSuiteIT test {yaml=reference/how-to/search-speed/line_87}
  • DocsClientYamlTestSuiteIT test {yaml=reference/indices/forcemerge/line_47}
  • DocsClientYamlTestSuiteIT test {yaml=reference/ml/detector-custom-rules/line_146}
  • DocsClientYamlTestSuiteIT test {yaml=reference/rollup/apis/stop-job/line_54}

It (unsurprisingly) doesn't reproduce for me.
job-92-consoleLog.txt.gz

Metadata

Metadata

Assignees

Labels

:StorageEngine/RollupTurn fine-grained time-based data into coarser-grained data>test-failureTriaged test failures from CI

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions