Skip to content

Systemd startup timeouts after upgrade to 7.4 #49593

Closed
@PhaedrusTheGreek

Description

@PhaedrusTheGreek

It has been observed/reported in a few separate environments where after an upgrade from Elasticsearch 7.3 to 7.4, systemd will kill the elasticsearch process before it finishes starting up.

The end of the log looks like this:

[2019-11-25T07:16:46,095][DEBUG][o.e.a.ActionModule ] [node1] Using REST wrapper from plugin org.elasticsearch.xpack.security.Security
[2019-11-25T07:17:22,306][INFO ][o.e.x.m.p.NativeController] [node1] Native controller process has stopped - no new native processes can be started

Trace inspection of the log reveals that the ES node is busy upgrading.

Systemd requires that processes signal after successful startup with sd_notify(READY) before TimeoutStartSec, which in ES doesn't happen until metadata upgrades are complete. It seems that we need to update SystemdPlugin to support EXTEND_TIMEOUT_USEC in the event of any expected startup delays such as upgrade.

Metadata

Metadata

Assignees

Labels

:Delivery/PackagingRPM and deb packaging, tar and zip archives, shell and batch scriptsTeam:DeliveryMeta label for Delivery team

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions