Skip to content

Missing request timeout config for ruler_storage #4835

Closed
@KaiserNicols

Description

@KaiserNicols

Describe the bug
Observing timeouts when the Ruler makes a request to Azure storage (/api/v1/rules) to fetch the alert rules. We have roughly 60k rules in storage for a particular tenant, so could be a very large payload if everything comes in a single response.

I suspect the default request timeout is 30s, but I don't see a 'request_timeout' field for Azure in the ruler_storage config.

#4581 added some http configs for Azure storage, but not for a request timeout. I no longer see this in the Cortex docs, but previously there was ruler.storage.azure.request_timeout. This setting is still visible via the /config api.

I get this timeout in Postman as well.

To Reproduce
Run Cortex

  • Distributed mode on AKS, rendered with Helm
  • 1.13.0

Connect the ruler to an azure storage (via the ruler_storage config).
Upload a large amount of rules to storage
Try to have ruler fetch the rules (I believe this is done via /api/v1/rules)

Expected behavior
Ruler should fetch the alerts, and not get timed out.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions