Remove the need for `worker_name` to simplify scaling #8084

ulope · 2020-08-13T13:59:52Z

Description:

The recent-ish changes to workers now recommend to set a unique worker_name per worker process. This makes scaling the number of workers quite a bit more complex than previously since now every instance needs a tailor made config file.

This is especially annoying when using things like docker-compose, swarm or kubernetes where spinning up multiple (identical) instances of a service is a built-in feature.

I have no concrete proposal on how to solve the problem(s) the worker name solves though. (AFAICS they're only really necessary for reverse mapping for federation senders and stream writers?)

The text was updated successfully, but these errors were encountered:

ananace · 2020-08-14T19:55:58Z

Coming from the Kubernetes angle, there's a common method that's used in there to solve this exact issue, where the internal cluster DNS will generate multi-value records that let you look up all the active pods on a certain service through A and PTR lookups.

Though considering Redis can be used for replication now, perhaps that could be a better solution - albeit one that will make Redis a much harder dependency for Synapse with workers.

ghost · 2021-03-23T14:34:53Z

A possible solution for Kubernetes would be to use StatefulSets + Environment Variables

Then we need to change this line in the source code
https://github.com/matrix-org/synapse/blob/develop/synapse/config/workers.py#L124

~~self.worker_name = config.get("worker_name", self.worker_app)~~

self.worker_name = environ.get("HOSTNAME", "i_have_no_env")

P.S. As a temporary solution in the current situation

ananace · 2021-03-23T17:39:35Z

The issue with using a StatefulSet to get a stable name is that they're designed for running applications where the runtime state is extremely important - things like databases.
Using them for stateless applications like the workers would mean that the cluster can't schedule them as easily, and will in fact make them much more fragile as the cluster will refuse to evict or replace them when necessary - as that'd cause them to lose their "state".

I already see far too many stateless things that start up as STS simply to get stable names.

ulope · 2021-03-23T18:16:01Z

This also will not help with other solutions like compose, swarm etc.

ghost · 2021-03-23T19:09:57Z

FAICS they're only really necessary for reverse mapping for federation senders and stream writers?

Could someone make the documentation more detail to clarify this point?

ananace · 2021-03-24T12:16:51Z

I still feel - even more strongly now - that using Redis for this is the right way to go, that offers a good channel for querying all running workers.

clokep added enhancement z-p3 (Deprecated Label) A-Workers Problems related to running Synapse in Worker Mode (or replication) labels Aug 13, 2020

DMRobertson added T-Enhancement New features, changes in functionality, improvements in performance, or user-facing enhancements. and removed z-enhancement labels Aug 25, 2022

dklimpel mentioned this issue Oct 26, 2022

Add workers settings to configuration manual #14086

Merged

4 tasks

matrixbot mentioned this issue Dec 21, 2023

Remove the need for worker_name to simplify scaling element-hq/synapse#8084

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove the need for `worker_name` to simplify scaling #8084

Remove the need for `worker_name` to simplify scaling #8084

ulope commented Aug 13, 2020

ananace commented Aug 14, 2020

ghost commented Mar 23, 2021 •

edited by ghost

Loading

ananace commented Mar 23, 2021

ulope commented Mar 23, 2021

ghost commented Mar 23, 2021

ananace commented Mar 24, 2021

Remove the need for worker_name to simplify scaling #8084

Remove the need for worker_name to simplify scaling #8084

Comments

ulope commented Aug 13, 2020

ananace commented Aug 14, 2020

ghost commented Mar 23, 2021 • edited by ghost Loading

ananace commented Mar 23, 2021

ulope commented Mar 23, 2021

ghost commented Mar 23, 2021

ananace commented Mar 24, 2021

Remove the need for `worker_name` to simplify scaling #8084

Remove the need for `worker_name` to simplify scaling #8084

ghost commented Mar 23, 2021 •

edited by ghost

Loading