Querier queries too much data when the head block may have enough data

**Describe the bug**
When executing a long backwards query (> `query_ingesters_within`), we have the possibility of querying too much data from both the ingesters and the store, particularly if the ingesters' head block contains a large amount of unflushed logs which could satisfy the request.

**To Reproduce**
- have a simple, non-filtering query like `{cluster="us-central1"} | json | line_format "{{.msg}}"`
- have a large number of unflushed lines in the head block (~7.5m in our case from all 130+ ingesters)
- execute the query over a 48h time range
- ingesters OOM

**Expected behavior**
The querier should query the ingesters for their unflushed data, and not need to examine chunks unnecessarily from the store.

**Screenshots, Promtail config, or terminal output**
![image](https://user-images.githubusercontent.com/373762/174773726-17da308a-f1fd-428a-8514-5769ec8463d3.png)
Grafana Cloud trace ID: `410e61362f6ac7c2`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Querier queries too much data when the head block may have enough data #6441

dannykopping
openedon Jun 21, 2022

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Querier queries too much data when the head block may have enough data #6441

Description

dannykoppingopenedon Jun 21, 2022

Activity

Metadata