Skip to content

Burrow unreadiness when metrics sidecar gets OOMKilled #255

Open
@solsson

Description

@solsson

I'm logging this issue because there shouldn't be a relation between burrow and the JMX exporter.

To reproduce:

  • Run Kafka with the metrics container, grant it only enough memory to start and run.
  • Run Burrow.
  • Hit the metrics endpoint on one broker so that metrics gets oomkilled.
  • The broker will be 1/2 ready.
  • Burrow typically shows unreadiness.

It's noteworthy that Burrow is configured to access brokers through headless service broker name resolution. That differs from the typical bootstrap process that kafka clients will do. However bootstrap might also be affected, in particular if all metrics pods get oomkilled at the same time. I was unaware until I read the librdkafka 1.0.0 release notes that bootstrap is a persistent connection.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions