Distr Chat

Horizontally scalable and high availability real-time messaging system.

Running the Project

The whole project can be run with a single docker compose up command.

Tested only with Windows 11, Docker Engine v20.10.22, and Docker Compose v2.51.1.

Install and run Docker if not already installed. For Windows, you will also need WSL 2 as described in the Docker documentation (Hyper-V probably works too, but it is not tested). If your Docker installation did not come with the Docker Compose plugin, you need to install that too.
Clone this repository:

git clone https://github.com/VilleKylmamaa/Distributed-Systems-Project.git

Open command prompt in the root folder (Distributed-Systems-Project)
Create a Docker network:

docker network create --subnet=177.17.0.0/16 distr_network

Run the project:

docker compose up

The images pulled will be .NET 6.0, Redis 7.0.8, and Node:18.14.0.

If the frontend has problems running in Docker, you may enter the ./Frontend folder, first run npm install, and then run npm run dev. At one point, this was an issue on some PCs but not all, but it should be fixed now.

Open Docker Desktop (this guide is for Docker Desktop, you may also use the CLI if you like), and open/expand the "distr_chat" multi-container.

If everything launched correctly, the view should be like this (sorted by name):

The containers running should be: 3 application servers, 3 backplanes, 3 load balancers, and a single frontend.

The container named Backplane_Cluster_Starter has exited on purpose. This container is used only to execute a command to activate Redis Cluster mode in the 3 backplane nodes, and it exits on success.

Observable Functionality

To open the different user interfaces, you can simply click the Port links on Docker Desktop. You can find the nodes running in these ports:

Frontend: http://localhost:3001/
Application_Server_1: http://localhost:5001/
Application_Server_2: http://localhost:5002/
Application_Server_3: http://localhost:5003/
Load_Balancer_1: http://localhost:7001/
Load_Balancer_2: http://localhost:7002/
Load_Balancer_3: http://localhost:7003/

The backplane containers have no UI.

User Interfaces

Type a username and a room name to enter a chat room. It's a typical chat interface. I got tired of typing messages when testing, so there is a helpful Spam :) button to quickly send messages.

The application server UI keeps track of the rooms and their user counts, and the Redis backplane connections:

The load balancer UI keeps track of the availability of the application servers and their WebSocket connetion counts. The events when a frontend client is connected to an application server are also logged.

If you like seeing lots of words, you may also check the Docker logging in the command prompt with which you ran docker compose up to follow what is happening in the system. In the log, you may spot this backplane warning: "WARNING Memory overcommit must be enabled!...". This warning could be fixed by enabling overcommit on your host machine (i.e., not inside Docker - impossible because of privileges), but it does not affect testing this system and you should just ignore it.

Observing the Load Balancer Algorithm Working

The load balancers always connect the client to the application server with the lowest number of current connections. You may observer this happen by opening multiple frontend browser tabs and joining chat rooms. For example, open 3 tabs and join chat (same or different rooms - doesn't matter). Then, leave and rejoin with a client either connected to app server 1 or 2. You will observe that the clients are always connected to the app servers with 0 connections. Only with 4 tabs you will be able to get 2 connections in the same app server.

Observing the Backplane Working

Notice that as long as the users have joined the same room, they will receive each other's messages even though they are connected to different application servers.

Observing the Fail-Overs and States of Failure

Stop and relaunch different nodes in Docker Desktop by clicking the stop or start buttons:

If there is at least one node of each type running (1 application server, 1 load balancer, 1 backplane), you should observe the frontend chat still available and working.

When killing backplane nodes, there might be a few seconds during which you can't send messages. When you kill application nodes, the clients connected to the killed node will get thrown out of the chat rooms.

This is still a working state:

If you kill load balancer nodes 1 and 2, you will notice the client takes longer to connect as it always tries to contact load balancer nodes 1 and 2 first.

If you kill all the load balancer nodes, the frontend client will not be able to connect to an application server. The frontend client will attempt to contact each load balancer 3 times, before giving up after all the attempts timeout.

If you kill either all the application server nodes or the backplane nodes, the load balancer will give an error to the client and log: "Client attempted to connect while all app servers were unavailable".

After killing all the nodes of a certain type, and then restarting any of them, you should observe the system return to a functioning state (an available chat service).

You may observe the application server and load balancer UIs update as you stop and restart nodes, to see what a specific application server or load balancer knows about the state of the system. If those UIs do not update as you would expect, its likely a bug in updating the UI (try refreshing the page, F5) and the underlying system is still working as expected.

Design Documentation

This distributed systems project is a chat application which is designed as a distributed system in order to enable high availability and horizontal scaling. The frontend is kept as a simple chat in order to focus on the backend system design and architecture. This same setup could of course be used for other types of applications which utilize real-time updates for a high amount of concurrent users.

The chat will connect to an application server with a WebSocket connection in order to achieve real-time updates. However, the number of WebSocket connections is quite limited for a single server. It often becomes the limiting factor before the message throughput rate does. Therefore, the system should be designed in such a way that the number of application servers can be increased to match the load. In other words, the application servers should be horizontally scalable.

Another reason for running the same service on multiple servers is achieving high availability. That is, the system should be as fault tolerant as possible in order to minimize potential downtime of the service. To achieve this, it should be possible to run replicas of each part of the system so that there is no single-point-of-failure in the system. The system should continue running if any of the different types of nodes in the system fail.

High availability and fault tolerance are the main focus points of the project. To prove horizontal scalability having access to multiple real servers is required. Thus, while horizontal scalability is a motivational factor in this project, its proof is left as an implication of being capable of spreading the WebSocket connections over multiple nodes.

Architecture and Design

In the system, a client may connect to any one of the application servers through a load balancer, and the client should still receive messages sent by a client which is connected to any different application server. This is not possible without another type of node or some other solution to relay messages between the application servers because the application server can only send messages to clients which are connected to it. In this project, I will utilize a backplane server as a message broker between the application servers. Thus, there will be 4 different types of nodes, as I have depicted in the system architecture diagram below.

Frontend Client

Implemented with: TypeScript, React framework.

Frontend is the chat application visible to the user. There will be as many client nodes as there will be users. First, the user will input an username and a chat room name. In the chat room, the user can send text messages and will see messages sent by other users in the same room.

Load Balancer

Implemented with C#, .NET Core application.

The purpose of the load balancer is to spread the clients' connections evenly between the application servers. The load balancer communicates with the application servers to receive data of the application server's current load. It will make the decision about which application server to connect the user to based on the load. Load in the algorithm being the number of WebSocket connections to the server. The load balancer will connect the client to the application server with the lowest load (or any server tied for the lowest load).

Application Server

Implemented with: C#, .NET Core application. Handling of the WebSocket connections and sending messages with SignalR.f

The application server handles the main server side business logic of the application. It will hold the WebSocket connections from the clients, through which it will receive messages from the clients and publish those messages to the correct chat rooms. The application server will also track the number of users in each chat room, and importantly for the load balancer, the total count of connections.

I consider that there is no need for persistence for the messages. The system is interested purely in real-time communication. If there was a need, a database or a data stream (Apache Kafka, Redis Stream, etc.) depending on the persistence requirements could be incorporated.

Backplane

Implemented with: Redis.

The purpose of the backplane is to relay messages between the application servers. It is essentially a message broker. There is support for a backplane implementation in SignalR, options including for example Redis, Azure, and SQL Server backplane-options. I chose Redis as the backplane because it is open source and, as opposed to cloud options, you can set up your own Redis servers.

I researched the ways to achieve high availability in Redis. In Microsoft's .NET documentation entitled "Set up a Redis backplane for ASP.NET Core SignalR scale-out", everything that was originally written about Redis Cluster was that "Redis Clustering is a method for achieving high availability by using multiple Redis servers. Clustering isn't officially supported, but it might work.". The last sentence was changed to "Clustering is supported without any code modifications to the app" in January 2023 (source).

However, researching further I found out that the message throughput (messages per second) of Redis Pub/Sub decreases as the number of nodes in the cluster is increased (source: video) (source: presentation slides). SignalR uses Redis Pub/Sub in any Redis backplane option under the hood. Thus, a high availability setup where there is only a single active backplane node would be more optimal. I opened up an issue in the aspnetcore GitHub asking for an update to the documentation (source). I also gave a feature suggestion that the sharded Pub/Sub released in the Redis version 7.0 in April 2022 could be utilized to possibly achieve horizontal backplane message throughput scaling for SignalR. Following the issue, I also opened a pull request to update the Redis Cluster section in the SignalR Redis documentation to inform about the message throughput trade-off and to guide in the configuration for the Cluster (source). This pull request was merged on February 10, 2023, making me one of the contributors in the documentation article, specifically the "Redis Cluster" section.

What then if the number of messages delivered by the backplane decreases with the number of nodes in the Redis Cluster, can it still be used as the backplane? In benchmarks by Redis the messages throughput was still 100 000 messages per second for a Cluster of 3 active nodes (source: video) (source: presentation slides). I will research further if Redis Sentinel or simple replication is compatible as a backplane for SignalR. If compatibility with SignalR would require forking the repository of SignalR, Redis, or the underlying dependencies, I will consider this outside the scope of this project, as the Redis Cluster will still satisfy the main goal of the project - high availability - and a message throughput of 100 000 per second is high enough for many systems. In the discussion of the GitHub issue it was mentioned that the SignalR implementation uses a library called StackExchange.Redis which should first support sharded Pub/Sub before SignalR tries to utilize it (source).

Naming

The application server handles any logic related to naming. WebSocket connections have unique identifiers so that the total count of connections and users in each chat room can be tracked. Messages appointed to a specific chat room will only be broadcasted to the room with that specific name. A unique prefix will be used for the communication through the backplane so that the messages of this system would never be confused with the messages of another module which might use the same backplane in the future.

Coordination

The coordination in the system which I will implement is the load balancer communicating with the application servers to gain information about their current active connection count and connect new clients based on that information. The Redis Cluster or the other potential Redis options have their own in-built coordination.

Consistency and Replication

Each node will be replicable. However, due to no persistent data in the system, consistency is mainly not a concern. The only type of consistency visible to the client will be the order of the chat messages. That is, different clients could receive the same messages in different order. The application server which first receives the message will timestamp the messages and the correct order or the messages will be decided by these timestamps. If the messages are received on the exact same millisecond then I make the decision to not care which message gets shown first. I consider the price for the messages being in a wrong order in a rare case acceptable. Having lax consistency requirements lets us focus on availability and fault tolerance (CAP theorem).

Security

I consider security not to be a focus of this project. For future work, the nodes should require API keys for communication, the messages could be encrypted in the backend, and the users could have authentication in the frontend.

Evaluation - High Availability, Fault Tolerance

High availability and fault tolerance are the main evaluation criteria of the system. Testing and showcasing the system will be accomplished by stopping and restarting the Docker containers to mock server failures in all the different types of nodes (frontend, load balancer, application server, backplane). As long as not all the nodes of a single type are killed, you should be able to kill any node and the system will continue running seemingly without fault to the client, other than possibly a few seconds hiccup before connection to other node replaces the fallen node. The chat service would be unavailable only if all the nodes in one of the four categories are killed.

As mentioned in the introduction, horizontal scalability was one of the motivations for this project but due to need of real separate servers its proof is left as an implication of being able to replicate the application servers. In other words, multiple servers should be able to hold more WebSocket connections than a single server. Making the message throughput of the Redis backplane horizontally scalable I consider to be future work due to the dependency mentioned with StackExchange.Redis.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
Application Server		Application Server
Backplane		Backplane
Frontend		Frontend
Load Balancer		Load Balancer
_Documentation		_Documentation
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distr Chat

Running the Project