Skip to content

Scheduler plugin to dump cluster state on close #5660

Closed
@gjoseph92

Description

@gjoseph92

We've run into issues where the scheduler unexpectedly cleanly shuts itself down after running for a very long time. Having a dump of cluster state would help to debug this.

After #5659 is implemented, write a SchedulerPlugin with a close hook that dumps cluster state. The filename where the state is written can be either passed into the plugin instance.

If the cluster state dump fails, or writing to the destination fails, this should not affect the shutdown process—just log the problem and move on.

Metadata

Metadata

Assignees

Labels

diagnosticsgood second issueClearly described, educational, but less trivial than "good first issue".

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions