SQL - utilise JSON aggregations for relationships #14532

mike12345567 · 2024-09-06T09:48:26Z

Description

This PR addresses a number of issues with loading a large number of relationships in Budibase, primarily documented in: #13820.

We also have a linear ticket: https://linear.app/budibase/issue/BUDI-8311/rows-in-a-table-go-missing-when-multiple-relationships-are-attached-to which discusses the change this PR implements, moving to use JSON aggregations to retrieve relationships rather than joining them onto the main table.

The issue with our old method is that as the number of many relationships grows, the size of the response data grows exponentially. For example if a table has 10 many relationships attached to it, each with 1000 rows related to a row in the main table, this is not a response size of 10001 rows (1000 rows for each of the 10 relationships, and 1 row for the main table row) - it is somewhere in the region of a response size 1000 to the power of 10. This is an un-avoidable issue with joining, usually this would be avoided by simply not joining everything, but that is not how Budibase works, we need all relationship data available for the page so that we can calculate formulas (amongst other requirements, like relational filtering etc).

Even if we do not return the full result set we still end up in this scenario, as the sheer amount of data getting joined can cause the query to never respond. We see this when working with SQS, we can cause threads to stop responding with heavily related data.

This fix means that the main table never has any joins on it - all joins are singular and part of sub-queries. This means that the response time will always scale O(n) - with the number of rows that need joined nothing more. The sub-queries can retrieve a large number of rows from the related table (up to our hard limit) and then merge these using JSON aggregation functions into a single column per row, this works well as this is what Budibase would normally do with the data set anyway.

This also fixes some issues we have with pagination - when we limit the main query to 100, we will always get 100 rows from the main table if there is that many rows - this improvement helps implement some other features.

…ny-relationships

…nship addition.

…hich are problematic.

…ny-relationships

…thod.

…reggation working.

…rds-compat Test against SQL Server 2017, get JSON aggregation of relationships working under 2017.

Apply new relationship retrieval to all SQL DBs

packages/backend-core/src/sql/sql.ts

packages/server/src/api/controllers/row/utils/basic.ts

packages/server/src/api/routes/tests/queries/generic-sql.spec.ts

samwho

Went through this on a call with Mike just now, approval in principal. Just a few little things to look into that we've left in comments throughout.

packages/server/datasource-sha.env

…ny-relationships

…lows for 500 rows per relationship.

…ell.

mike12345567 and others added 30 commits August 23, 2024 18:00

Implementing a JSON aggregate method of selecting relationships.

ab5f50d

Getting processing of SQS relationships working.

80f3e59

Getting fields from all relationships loading correctly.

5d53e64

Adding limit in for wide tables to be related correctly.

b11ee56

Merge branch 'master' of github.com:Budibase/budibase into fix/sql-ma…

181aa24

…ny-relationships

Moving things around, making join logic more accessible.

0c604b7

Merge branch 'master' of github.com:Budibase/budibase into fix/sql-ma…

413628c

…ny-relationships

Saving at this point - got exists working.

49c1f34

Getting through join working as expected.

6289643

Merge branch 'master' of github.com:Budibase/budibase into fix/sql-ma…

b217e83

…ny-relationships

Check for alias as well when deciding whether filter requires relatio…

3e51dde

…nship addition.

Some improvements to get SQS tests passing.

a9b1a22

Adding the option to disable user sync, always importing large apps w…

6730105

…hich are problematic.

Merge branch 'master' of github.com:Budibase/budibase into fix/sql-ma…

6407f5b

…ny-relationships

Merge branch 'master' of github.com:Budibase/budibase into fix/sql-ma…

fc31a28

…ny-relationships

Merge branch 'master' of github.com:Budibase/budibase into fix/sql-ma…

7e7e23d

…ny-relationships

Fixing an issue with inconsistent relationship order.

ac7838f

Work to support all SQL DBs across the board using the aggregation me…

b29a4e2

…thod.

Correcting test cases.

2a24a3d

Fix for sorting, didn't account for some primitive types.

2d6a8d9

Linting.

fed82df

Fix for generic sql test.

eefb1f0

Updating to use a sub-query with a wrapper to get the JSON aggregations.

79de7b2

Revert to testing against mssql 2017, attempt to get relationship agg…

12db645

…reggation working.

Resolve merge conflicts.

e90aff9

Fix some MSSQL test cases.

cda7785

Merge pull request #14515 from Budibase/sql-server-aggregation-backwa…

fa2611b

…rds-compat Test against SQL Server 2017, get JSON aggregation of relationships working under 2017.

Slight refactor.

637ac55

Getting MariaDB to work again.

e30469c

Fixing aliasing test cases.

7cdf813

mike12345567 added 3 commits September 5, 2024 18:12

Fixing SQL unit tests.

888c421

Updating select statement generation.

f7d9b8a

Merge pull request #14509 from Budibase/aggregate-all-sql-dbs

aac1d76

Apply new relationship retrieval to all SQL DBs

mike12345567 self-assigned this Sep 6, 2024

mike12345567 requested a review from a team as a code owner September 6, 2024 09:48

mike12345567 requested review from adrinr and removed request for a team September 6, 2024 09:48

github-actions bot added firestorm Data/Infra/Revenue Team size/xl labels Sep 6, 2024

mike12345567 commented Sep 6, 2024

View reviewed changes

packages/backend-core/src/sql/sql.ts Show resolved Hide resolved

mike12345567 commented Sep 6, 2024

View reviewed changes

packages/backend-core/src/sql/sql.ts Show resolved Hide resolved

mike12345567 commented Sep 6, 2024

View reviewed changes

packages/server/src/api/controllers/row/utils/basic.ts Show resolved Hide resolved

mike12345567 commented Sep 6, 2024

View reviewed changes

packages/server/src/api/routes/tests/queries/generic-sql.spec.ts Show resolved Hide resolved

samwho approved these changes Sep 6, 2024

View reviewed changes

mike12345567 added 2 commits September 9, 2024 16:07

Merge branch 'master' into fix/sql-many-relationships

e2c6893

Merge branch 'master' into fix/sql-many-relationships

8e8946b

adrinr approved these changes Sep 10, 2024

View reviewed changes

packages/server/datasource-sha.env Show resolved Hide resolved

mike12345567 added 9 commits September 10, 2024 12:12

Merge branch 'master' of github.com:Budibase/budibase into fix/sql-ma…

5e80a97

…ny-relationships

Adding SQL_MAX_RELATED_ROWS environment variable, defaults to 500, al…

f63c95e

…lows for 500 rows per relationship.

Handling JSON types within relationships, they need to be parsed as w…

2fd5c1a

…ell.

Linting.

d1b12b8

Updating test case.

86a6664

Adding test case for getting related array column in a JS formula.

1582e32

More incorrect limits.

9a61ec5

Fix for test case.

595dd7e

Merge branch 'master' into fix/sql-many-relationships

fa6058c

mike12345567 merged commit 31f8691 into master Sep 11, 2024
11 of 12 checks passed

mike12345567 deleted the fix/sql-many-relationships branch September 11, 2024 09:33

github-actions bot locked and limited conversation to collaborators Sep 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SQL - utilise JSON aggregations for relationships #14532

SQL - utilise JSON aggregations for relationships #14532

mike12345567 commented Sep 6, 2024

samwho left a comment

SQL - utilise JSON aggregations for relationships #14532

SQL - utilise JSON aggregations for relationships #14532

Conversation

mike12345567 commented Sep 6, 2024

Description

samwho left a comment

Choose a reason for hiding this comment