Non-windowed updating aggregates using datafusion. #588

jacksonrnewhouse · 2024-04-12T22:04:59Z

The main functionality this provides is the ability to run aggregates without windows, emitting update and retract messages that can be written to a debezium sink.

The logic for calculating the aggregate is done in UpdatingAggregatingFunc. This operator has three different versions of the aggregate exec, with three different modes:
Partial: Takes input and emits partial aggregate representations.
CombinePartial: Merge multiple partials into a single partial. This mode was added in https://github.com/ArroyoSystems/arrow-datafusion/pull/1/files.
Final: The final aggregation that finishes any aggregates, expecting partials as input.

These are combined with the new LastKeyValueView. This is a simple key-value map that uses the _timestamp field as expiration time. For any group by tuple there'll be at most one live entry in the map. Writes to state include a _generation field in parquet, which is used to ensure we restore the newest value.

In the operator data is fed into the partial exec until it is time to flush, which happens under the following conditions:

A 1 second tick has passed.
A checkpoint is received.
The watermark has advanced such that there is data in the backing tables that is ready to be expired (this is necessary because there may be fresh data that should keep that key alive).

Flushing follows the following steps:

Close the active sender for computing partials. If there isn't one, there's no work to be done, so just exit.
Compute the new partial data that has been received since the last flush.
Look for data in the store of partial data that have the same key-set as the new partials. If there aren't any, skip to step 5.
Feed the data in 2 and 3 to the combine exec, then spool out its output, writing them to state and storing them as the input to the final step.
For the final result, first check if the final table ("f") has any matches. These will become retracts. Then, write the new data to that final table. The retracts will be emitted before the appends.

In order to make progress between flushes, the partial exec is advanced. We panic in handle_future_result() because the input will never have been closed on that exec.

Some other things that were changed in this PR:

Reworked how we create the sort projections into Joins. It as running into some corner cases that were raised when updating the smoketest errors.
Removed the `"single_distinct_aggregation_to_group_by", which causes COUNT(*) to become a nested aggregate.

jacksonrnewhouse force-pushed the df_updating_aggregates branch from ba93388 to beec1f6 Compare April 12, 2024 22:08

jacksonrnewhouse force-pushed the df_debezium branch from 15ebc97 to 5eccc2f Compare April 13, 2024 00:26

jacksonrnewhouse force-pushed the df_updating_aggregates branch 2 times, most recently from df46713 to 542d562 Compare April 17, 2024 23:58

jacksonrnewhouse marked this pull request as ready for review April 18, 2024 00:26

jacksonrnewhouse force-pushed the df_updating_aggregates branch from 1f9dad5 to d6576bc Compare April 18, 2024 00:59

jacksonrnewhouse requested a review from mwylde April 18, 2024 01:23

mwylde approved these changes Apr 18, 2024

View reviewed changes

Debezium and updating aggregate support.

8a1e31a

jacksonrnewhouse changed the base branch from df_debezium to master April 22, 2024 17:32

jacksonrnewhouse force-pushed the df_updating_aggregates branch from d6576bc to 8a1e31a Compare April 22, 2024 17:32

jacksonrnewhouse enabled auto-merge (squash) April 22, 2024 17:32

jacksonrnewhouse merged commit 74d0097 into master Apr 22, 2024
6 checks passed

jacksonrnewhouse mentioned this pull request Apr 22, 2024

Updating Tables: Initial pass at supporting debezium sources and sinks. #586

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-windowed updating aggregates using datafusion. #588

Non-windowed updating aggregates using datafusion. #588

jacksonrnewhouse commented Apr 12, 2024 •

edited

Loading

Non-windowed updating aggregates using datafusion. #588

Non-windowed updating aggregates using datafusion. #588

Conversation

jacksonrnewhouse commented Apr 12, 2024 • edited Loading

jacksonrnewhouse commented Apr 12, 2024 •

edited

Loading