Skip to content

Static alert channel length of 200 is a bottleneck in Alertmanager #4392

Open
@siavashs

Description

@siavashs

Alertmanager memory provider has a static channel length of 200:

const alertChannelLength = 200

This value has been there with no description for almost a decade since introduction in cc5aab6

It seems like a bottleneck in Alertmanager, since a slow alert pipeline can result in API timeouts on POST /api/v2/alerts as the pipeline struggles to consume from the channel.
Root cause of a slow pipeline can be from any of:

  • high volume of alerts
  • high volume of suppressed alerts (inhibition or silences), regex matching can have significant affect here
  • slow receivers

With #4364 the channel length becomes configurable, allowing bigger alert spikes to be queued for processing by Alertmanager.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions