[Messages] Sync job #285

odesenfans · 2022-05-24T14:17:06Z

Added a new job that synchronizes unconfirmed messages across
the network. The goal of this job is to re-send messages missed
by nodes with the ability to push data on-chain. This can happen
because of various issues like server downtime or bugs.

This job works in three parts:

the publisher task periodically sends the list of all the messages
older than the last TX block that have yet to be confirmed by
on-chain data.
the receiver task stores the list of unconfirmed messages for
each peer detected on the network.
the sync/aggregator task aggregates the confirmation data from
all the nodes and fetches messages using the HTTP API.
These messages are added to the pending message queue.

This solution is less expensive that constantly sharing all
the messages across all the nodes and guarantees that the network
will be synchronized eventually as long as the on-chain data
synchronization jobs are working. With the current implementation,
a message can remain out of sync at a maximum until a new TX
is published on-chain + the job period (5 minutes currently).

Fixed an issue where the pending message job would block on the final messages in the queue and stop processing newer messages. Once the job finishes the loop on all the messages in the pending message collection, the previous implementation waits until all the message tasks finish. This causes a delay of several hours until the node finishes these tasks and is able to process newer pending messages again. Messages end up being processed, but far later than expected. The issue arises because we never remove messages from the pending queue if we fail to retrieve the associated content. The job then always has messages in the queue, causing the issue. Fixed the issue by allowing the loop to restart without waiting for messages to be processed. We now compute an individual ID for each pending message and add it to a set. The job will simply ignore any message that is already being processed, allowing for newer messages to be taken into account.

Added a new job that synchronizes unconfirmed messages across the network. The goal of this job is to re-send messages missed by nodes with the ability to push data on-chain. This can happen because of various issues like server downtimes or bugs. This job works in three parts: * the publisher task periodically sends the list of all the messages older than the last TX block that have yet to be confirmed by on-chain data. * the receiver task stores the list of unconfirmed messages for each peer detected on the network. * the sync/aggregator task aggregates the confirmation data from all the nodes and fetches messages using the HTTP API. These messages are added to the pending message queue. This solution is less expensive that constantly sharing all the messages across all the nodes and guarantees that the network will be synchronized eventually as long as the on-chain data synchronization jobs are working. With the current implementation, a message can remain out of sync at a maximum until a new TX is published on-chain + the job period (5 minutes currently).

odesenfans requested a review from hoh May 24, 2022 14:17

odesenfans assigned hoh May 24, 2022

odesenfans force-pushed the od-message-sync-job branch from a88ffff to b80581f Compare May 24, 2022 14:23

odesenfans added 2 commits June 14, 2022 13:44

[Release] Bump to 0.3.2 (aleph-im#304)

d10cdd0

odesenfans force-pushed the od-message-sync-job branch from b80581f to 8f0bdef Compare June 14, 2022 13:17

odesenfans force-pushed the od-message-sync-job branch from 8f0bdef to 2df8d7b Compare June 14, 2022 13:21

odesenfans force-pushed the dev branch from 8abefc9 to aa38b38 Compare October 27, 2022 11:53

odesenfans force-pushed the dev branch from aa38b38 to 1e4d83c Compare January 9, 2023 22:19

odesenfans force-pushed the dev branch 2 times, most recently from d9f920c to 3d016e0 Compare March 21, 2023 13:43

odesenfans force-pushed the dev branch from e74460f to 5cd4496 Compare April 24, 2023 14:47

odesenfans force-pushed the dev branch 2 times, most recently from 48b3f90 to 11bf24a Compare October 27, 2023 17:03

hoh removed their assignment May 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Messages] Sync job #285

[Messages] Sync job #285

Uh oh!

odesenfans commented May 24, 2022

Uh oh!

Uh oh!

[Messages] Sync job #285

Are you sure you want to change the base?

[Messages] Sync job #285

Uh oh!

Conversation

odesenfans commented May 24, 2022

Uh oh!

Uh oh!