Add compression to Synapse in-memory data caches for decrease total RAM usage #8990

MurzNN · 2020-12-25T06:26:33Z

Description:

Synapse memory caches becomes very large on many setups (after manual tuning of cache levels to make Synapse more responsive), and eats a lot of RAM. But cached information have a very good compression ratio (because it contains an ASCII text mostly), so implementing some level of compression (even fastest) must significantly decrease whole Synapse memory usage. What do you think about this idea?

clokep · 2020-12-28T13:24:24Z

But cached information have a very good compression ratio (because it contains an ASCII text mostly)

I'm not sure this is really true -- I think most of the caches in Synapse cache Python objects, not just raw strings. Which caches were you looking at in particular to come to this conclusion?

MurzNN · 2021-01-06T20:11:21Z

Python objects can be compressed using this trick https://stackoverflow.com/a/19500651 - as I understand, most of caches works like key-value storage, so we can simply store values as compressed strings, and instantly decompress on access. This will increase CPU load a bit, but decrease RAM usage.

Main candidates for compressing are *getEvent*, _get_joined_profile_from_event_id, _event_auth_cache (because for decrease cache misses I must increase them by 20-40 times from default values in my homeserver), but I can't measure the exact sizes of each cache in bytes, here is issue about this #8811 (comment)

Alternatively we can compress only large text data from objects (eg text body of message), but this make compressing task more complex.

ptman · 2021-01-07T14:46:14Z

I think if someone could prove that compressed cache improves performance it could be considered. But (un/)compressing can take time and caching is supposed to save time. Of course (un/)compressing can be much faster than I/O. But numbers will tell.

MurzNN · 2021-01-07T18:46:28Z

Yes, will be good to measure before implementing!

Average compression ratio of text data is usually more than 4:1, so with compression we will can keep in same RAM size about 4 times more items, than without it, this will can significantly decrease SQL queries rate.

For example, I manage public homeserver ru-matrix.org, that have 16gb of RAM limit. Increasing Synapse cache sizes is decrease cache misses from 1000+ rps to ~50 rps, but Synapse starts swapping because of filled RAM, and all becomes slow again :(

clokep · 2021-01-11T15:53:38Z

Note that we do some interning of the strings that get cached (see uses of intern_string and intern_dict), so there's some effort to re-use memory which could go away with compression.

Additionally pickling should generally not be done with any users provided data, so we would need to add some safe guards there. With the additional complexity this would add I think that this would not be worthwhile, unless numbers were provided showing a dramatic reduction in RAM usage.

I'm going to close this for now, but please experiment and let us know if this seems worthwhile!

MurzNN changed the title ~~Add compression to Synapse RAM caches data to decrease total RAM usage~~ Add compression to Synapse in-memory caches data to decrease total RAM usage Dec 25, 2020

MurzNN changed the title ~~Add compression to Synapse in-memory caches data to decrease total RAM usage~~ Add compression to Synapse in-memory data caches for decrease total RAM usage Dec 25, 2020

richvdh added the info-needed label Jan 1, 2021

clokep closed this as completed Jan 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add compression to Synapse in-memory data caches for decrease total RAM usage #8990

Add compression to Synapse in-memory data caches for decrease total RAM usage #8990

MurzNN commented Dec 25, 2020 •

edited

Loading

clokep commented Dec 28, 2020

MurzNN commented Jan 6, 2021 •

edited

Loading

ptman commented Jan 7, 2021

MurzNN commented Jan 7, 2021 •

edited

Loading

clokep commented Jan 11, 2021

Add compression to Synapse in-memory data caches for decrease total RAM usage #8990

Add compression to Synapse in-memory data caches for decrease total RAM usage #8990

Comments

MurzNN commented Dec 25, 2020 • edited Loading

clokep commented Dec 28, 2020

MurzNN commented Jan 6, 2021 • edited Loading

ptman commented Jan 7, 2021

MurzNN commented Jan 7, 2021 • edited Loading

clokep commented Jan 11, 2021

MurzNN commented Dec 25, 2020 •

edited

Loading

MurzNN commented Jan 6, 2021 •

edited

Loading

MurzNN commented Jan 7, 2021 •

edited

Loading