This repository has been archived by the owner on Apr 26, 2024. It is now read-only.
Full-text-search in Synapse 1.71.0rc1 may be prohibitively expensive (DB IO) #14354
Labels
A-Database
DB stuff like queries, migrations, new/remove columns, indexes, unexpected entries in the db
A-Message-Search
Searching messages
A-Performance
Performance, both client-facing and admin-facing
O-Occasional
Affects or can be seen by some users regularly or most users rarely
S-Major
Major functionality / product severely impaired, no satisfactory workaround.
T-Defect
Bugs, crashes, hangs, security vulnerabilities, or other reported issues.
From Matrix.org's monitoring:
In the "Toast table blocks read from disk/buffer" and "Toast index block read from buffer" graphs, there are intermittent blue spikes corresponding to the
event_search
table.Correlated: peaks in "total txn time" due to
search_rooms
And federation send PDU lag:
and event send time:
(Above graphs only showing the main process).
The event persister also saw pain at similar times:
It's a little tricky to interpret these, because the m.org database was updated this morning (2nd Nov 9.30 UTC) and there was some expensive background processing by postgres afterwards. (All times UTC in the graphs.)
Our suspicion is that the changes in #11635 are to blame. I reverted it, #13410 and #14311 on the hotfixes branch (37307a5) and deployed to matrix.org. We haven't seen the event persisters flare up since then... but it's not completely clear that the changed mentioned were the cause.
I think we should (regrettably) back out the changes on the release branch too, before a final 1.71.0 release.
The text was updated successfully, but these errors were encountered: