many duplicate `_get_state_groups_from_groups` queries leading to OOMs

We had a federation sender instance which died. On restart, it rapidly consumed all available ram and OOMed again.

Inspection from postgres side shows it is doing many duplicate queries of the form

```sql
WITH RECURSIVE state(state_group) AS ( VALUES(3405820::bigint) UNION ALL SELECT prev_state_group FROM state_group_edges e, state s WHERE s.state_group = e.state_group ) SELECT DISTINCT ON (type, state_key) type, state_key, event_id FROM state_groups_state WHERE state_group IN ( SELECT state_group FROM state ) ORDER BY type, state_key, state_group DESC
```

This query is `_get_state_groups_from_groups`, which is called from `_get_state_for_groups`. Although the latter has a cache, if many threads hit it at the same time, all will find the cache empty and go on to hit the database. I think we need a zero-timeout `ResponseCache` on `_get_state_groups_from_groups`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

many duplicate `_get_state_groups_from_groups` queries leading to OOMs #10301

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

many duplicate _get_state_groups_from_groups queries leading to OOMs #10301

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

many duplicate `_get_state_groups_from_groups` queries leading to OOMs #10301