Skip to content

[CI] AssertionError in OsProbe.readProcSelfCgroup #77833

Closed
@ywangd

Description

@ywangd

Multiple tests failed with process was found dead while waiting for ports files. The underlying issue is shown in the test cluster log file: The node crashed when starting up because it didn't read any valid content from the /proc/self/cgroup file. I wonder whether there is a subtle racing condition somwhere setting up the test.

Build scan:
https://gradle-enterprise.elastic.co/s/lfrzuigm2obi4
https://gradle-enterprise.elastic.co/s/wruq4uitjwndq

Repro line:
N/A

Reproduces locally?:
Didn't try

Applicable branches:
7.x

Failure history:
N/A

Failure excerpt:

[2021-09-16T01:10:58,208][ERROR][o.e.b.ElasticsearchUncaughtExceptionHandler] [yamlRestTest-0] fatal error in thread [main], exiting
java.lang.AssertionError: null
    at org.elasticsearch.monitor.os.OsProbe.readProcSelfCgroup(OsProbe.java:298) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.monitor.os.OsProbe.areCgroupStatsAvailable(OsProbe.java:579) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.monitor.os.OsProbe.getCgroup(OsProbe.java:637) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.monitor.os.OsProbe.getCgroup(OsProbe.java:857) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.monitor.os.OsProbe.osStats(OsProbe.java:864) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.monitor.os.OsService.<init>(OsService.java:39) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.monitor.MonitorService.<init>(MonitorService.java:33) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.node.Node.<init>(Node.java:509) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.node.Node.<init>(Node.java:288) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.bootstrap.Bootstrap$5.<init>(Bootstrap.java:219) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.bootstrap.Bootstrap.setup(Bootstrap.java:219) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.bootstrap.Bootstrap.init(Bootstrap.java:399) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.bootstrap.Elasticsearch.init(Elasticsearch.java:167) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.bootstrap.Elasticsearch.execute(Elasticsearch.java:158) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.cli.EnvironmentAwareCommand.execute(EnvironmentAwareCommand.java:75) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.cli.Command.mainWithoutErrorHandling(Command.java:114) ~[elasticsearch-cli-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.cli.Command.main(Command.java:79) ~[elasticsearch-cli-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.bootstrap.Elasticsearch.main(Elasticsearch.java:123) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
    at org.elasticsearch.bootstrap.Elasticsearch.main(Elasticsearch.java:81) ~[elasticsearch-7.16.0-SNAPSHOT.jar:7.16.0-SNAPSHOT]
[2021-09-16T01:11:29.937733503Z] [BUILD] Stopping node

Metadata

Metadata

Assignees

Labels

:Core/Infra/CoreCore issues without another label>test-failureTriaged test failures from CITeam:Core/InfraMeta label for core/infra team

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions