Skip to content

[CI] ML job snapshot upgrade IT can leave task behind #65364

Closed
@hendrikmuhs

Description

@hendrikmuhs

The snapshot upgrade API introduced in #64665 can leave tasks behind that cause unrelated tests to fail during teardown. Note: The same issue can happen in different flavors, with different (but similar) stack traces, different testcases. The root cause is always a dangling persistent task.

Build scan:https://gradle-enterprise.elastic.co/s/w2rrhghfdrz2g

Repro line: ./gradlew ':client:rest-high-level:integTest' --tests "org.elasticsearch.client.TransformIT.testGetMissingTransform" -Dtests.seed=5A311E3B23B621C1 -Dtests.security.manager=true -Dtests.locale=hi-IN -Dtests.timezone=Cuba -Druntime.java=8

Reproduces locally?: no

Applicable branches: 7.x, master

Failure history:

Failure excerpt:

org.elasticsearch.client.TransformIT > testGetAllAndPageTransforms FAILED
12:26:30     java.lang.AssertionError: 1 active tasks found:
12:26:30     xpack/ml/job/snapshot/upgrade[c] UMEUON07QrON2VzuzOvTzw:18696 cluster:31                   persistent 1606129934335 11:12:14 14.2m       127.0.0.1 integTest-0 job-snapshot-upgrade-test-upgrade-job-model-snapshot-1541587919
12:26:30      expected:<0> but was:<1>
12:26:30         at __randomizedtesting.SeedInfo.seed([5A311E3B23B621C1:3633578F9FE53698]:0)
12:26:30         at org.junit.Assert.fail(Assert.java:88)
12:26:30         at org.junit.Assert.failNotEquals(Assert.java:834)
12:26:30         at org.junit.Assert.assertEquals(Assert.java:645)
12:26:30         at org.elasticsearch.test.rest.ESRestTestCase.lambda$waitForPendingTasks$3(ESRestTestCase.java:418)
12:26:30         at org.elasticsearch.test.ESTestCase.assertBusy(ESTestCase.java:1011)
12:26:30         at org.elasticsearch.test.rest.ESRestTestCase.waitForPendingTasks(ESRestTestCase.java:394)
12:26:30         at org.elasticsearch.test.rest.ESRestTestCase.waitForPendingTasks(ESRestTestCase.java:381)
12:26:30         at org.elasticsearch.client.TransformIT.cleanUpTransformsAndLogAudits(TransformIT.java:166)

Metadata

Metadata

Assignees

Labels

:mlMachine learning>test-failureTriaged test failures from CI

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions