Topological order update #122

ra1028 · 2024-04-23T08:54:05Z

Pull Request Type

Description

Problem

Currently, the transitive updates of the downstream when atom is updated are done in random order. Also, if an equivalent dependency (atom or subscription) appears multiple times in the dependency graph, it will be updated by that number.
This not only causes redundant recalculation of atoms and updates of views but can even cause glitches due to atoms being updated while some of the dependencies are still out of date.

For example, the following case:

graph TD;
    A-->B;
    A-->C;
    B-->D;
    C-->D;
    C-->E;
    D-->E;

In this case, the worst update order would be:
A -> B -> D -> E -> C -> D -> E -> E
which updates E three times, and D twice, while the first update of D consumes an outdated state of C as C is yet to be updated at the time, and then accordingly E consumes an outdated value of C & D twice.
This is a significant issue in the performance of apps that use Atoms for state management.

Solution

Introduce a topological sorting algorithm to give the correct order for transitive updates.
In the example given above, the topological sorted order will be:
A -> B -> C -> D -> E
or
A -> C -> B -> D -> E
which doesn't contain redundant transitive updates while guaranteeing that all updates of its dependencies are completed before the atom is updated.
Since there is no reasonable factor between B and C that determines their update order, which pattern of those it will be is random.

Algorithm

DFS topological sorting [ref].

DFS: (Depth-first search)
BFS: (Breadth-first search)
Vertex: represents a node of a graph such as atom or subscription(view)
Edge: represents a relation between vertex and vertex, such as atom to atom or atom to subscription

DFS is a better fit to the library's data structure than BFS.
Although DFS implementations typically use recursive functions and are vulnerable to stack overflows, Atom's dependencies cannot realistically be that deep. Also, Swift's -O compile should optimize the recursive function to be equivalent to an implementation using while.

Normally, DFS aborts downstream traversal at the point where the vertex has already been traced, but this library doesn't abort and always traverses to the end of the graph while avoiding pushing the redundant edges to the sorted array. This is necessary to re-evaluate redundant edges later.
In the case of the above example, the sorted order is A -> B -> C -> D -> E, and let's say that when the state of D is updated, it is revealed that there's no need to update downstream as the new value has not changed from the old value (assuming that D is an atom with changes modifier). Even then, its downstream, E needs to be transitively updated as it depends on C, otherwise E will not reflect the changes of C.
To avoid this problem, redundant edges omitted during sorting are collected, and when the above situation occurs, it checks if the recorded edges contain the target vertex to determine whether it still needs updating.
The redundant edges in the example are:

B -> D
C -> E

Performance

The complexity of the algorithm is O(|V|+|E|) where V = Vertices, E = Edges, which is done in a linear time.
It is conceptually a bit slower than the current implementation but the result of the benchmark testing I have done showed that the performance from both clock time and memory perspective is almost the same as before in general cases while ensuring that it prevents additional causes of low performance such as redundant builds of atoms/views that are the most important part of app performance.

Benchmark test

rasberik · 2024-04-29T08:10:21Z

Sources/Atoms/Core/TopologicalSort.swift

+
+/// DFS topological sorting.
+@MainActor
+internal func topologicalSort(key: AtomKey, store: AtomStore) -> (


Global main actor func just to access main actor isolated store.graph, why not make it extension of AtomStore which already is on mainactor.. (not big fan of global functions with specific context)

[nit]
Also, do you think topologicalSort is a proper name given it returns edges? I can suggest to make it something more fitting the caller site → update(atom...) , e.g. childrenToUpdate() -> edges, redundants etc, and then hide the specific technique method like topologicallySorted() -> edges..

Global main actor func just to access main actor isolated store.graph, why not make it extension of AtomStore which already is on mainactor.

Sounds good to me.

do you think topologicalSort is a proper name given it returns edges?

I believe how it is sorted is important for the caller as the additional algorithm on the StoreContext doesn't work except with topological sorting.
Also, any other naming wouldn't justify it returning the pair of edges and redundant.
More descriptive naming like topologicallySortedEdgesWithRedundantPath would make it clear but it sounds too much.

I'm already on a bunch of refactoring in another branch, so will apply it there.

rasberik

Looks good to me, left few nit comments

shingt · 2024-04-30T04:00:51Z

Sources/Atoms/Core/StoreContext.swift

+        // Perform side effects first.
+        let state = getState(of: atom, for: key)
+        let context = AtomCurrentContext(store: self, coordinator: state.coordinator)
+        atom.updated(newValue: newValue, oldValue: oldValue, context: context)


Q: Does this mean we're changing the timing of the side effects, and in case usage accidentally relies on the current order (performing side effects at the last of this method), it might get affected?

Yes, before the change, it runs a side effect after finishing updating downstream, but now it runs immediately after updating its value.
In the first place, the order in which the atoms were updated was not guaranteed, so it's not possible that they were used in a way that depended on the update order. (Even if there were, it wouldn't work properly)

In the first place, the order in which the atoms were updated was not guaranteed, so it's not possible that they were used in a way that depended on the update order. (Even if there were, it wouldn't work properly)

Understood, thank you!

ra1028 added 9 commits April 24, 2024 19:14

Implement topological order update algorithm

a6adf5d

Refactoring

f3ffb28

Refactoring

ca18408

Track skipping atoms accurately

bc78787

Add test cases

080e94a

Refactoring

015b171

Support for complex cases of when transitive update is skipped

d98d473

Refactoring

e131fa5

Add more strict testing

bb79c39

ra1028 force-pushed the feat/topological-sort-update branch from 5d6b546 to bb79c39 Compare April 25, 2024 07:43

ra1028 added 2 commits April 25, 2024 19:06

Show all diffs when validation fails

e7ad694

Fix dev tool cache

b84abaa

ra1028 marked this pull request as ready for review April 25, 2024 11:04

ra1028 added 2 commits April 25, 2024 23:54

Refactoring

d4e0849

Update test case to be more strict

578e108

ra1028 force-pushed the feat/topological-sort-update branch 3 times, most recently from 13998f0 to cdc8bc7 Compare April 25, 2024 18:52

Refactoring

f90fd3c

ra1028 force-pushed the feat/topological-sort-update branch from cdc8bc7 to f90fd3c Compare April 26, 2024 06:26

Add test for topological sort function

d4a68a5

rasberik reviewed Apr 29, 2024

View reviewed changes

rasberik approved these changes Apr 29, 2024

View reviewed changes

shingt reviewed Apr 30, 2024

View reviewed changes

ra1028 merged commit 1bc21c6 into main Apr 30, 2024

ra1028 deleted the feat/topological-sort-update branch April 30, 2024 06:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Topological order update #122

Topological order update #122

Uh oh!

ra1028 commented Apr 23, 2024 •

edited

Loading

Uh oh!

rasberik Apr 29, 2024 •

edited

Loading

Uh oh!

ra1028 Apr 30, 2024

Uh oh!

ra1028 Apr 30, 2024

Uh oh!

rasberik left a comment

Uh oh!

shingt Apr 30, 2024

Uh oh!

ra1028 Apr 30, 2024

Uh oh!

shingt Apr 30, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Topological order update #122

Topological order update #122

Uh oh!

Conversation

ra1028 commented Apr 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Type

Description

Problem

Solution

Algorithm

Performance

Uh oh!

rasberik Apr 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ra1028 Apr 30, 2024

Choose a reason for hiding this comment

Uh oh!

ra1028 Apr 30, 2024

Choose a reason for hiding this comment

Uh oh!

rasberik left a comment

Choose a reason for hiding this comment

Uh oh!

shingt Apr 30, 2024

Choose a reason for hiding this comment

Uh oh!

ra1028 Apr 30, 2024

Choose a reason for hiding this comment

Uh oh!

shingt Apr 30, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ra1028 commented Apr 23, 2024 •

edited

Loading

rasberik Apr 29, 2024 •

edited

Loading