Skip to content

Ensure errors are being communicated to alerting services correctly #3609

Open
@tsachiherman

Description

@tsachiherman

** What ? **

Ensure that fatal/error/panic log from AvalancheGo and coreeth trigger our log based alerting.

** Why ? **

One of the index pods in data-dev account produced errors that were believed to be related to a corrupted backup process for leveldb.

While the cause for the issue is not known, detecting the issue earlier would have improve our reaction and handling time line.

** Goal **

Make the required changes in avalanchego, coreeth and other components, and make sure that fatal/error/panic log entries correctly trigger our log based alerting.

Metadata

Metadata

Assignees

Type

No type

Projects

Status

Backlog 🧊

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions