Incremental CAgg Refresh Policy #7790

fabriziomello · 2025-03-04T19:14:09Z

Nowadays a Continuous Aggregate refresh policy process everything only once independent of how large the refresh window is. For example if you have a hypertable with a huge amount of rows it can take a lot of time and requires a lot of resources in terms of CPU, Memory and I/O to refresh a CAgg, and all the aggregated data will be visible for the users only when the refresh policy complete it execution.

This PR add the capability of a CAgg refresh policy be executed incrementaly in "batches". Each "batch" is an individual transaction that will process a small fraction of the entire refresh window, and once the "batch" finishes the execution the data refreshed will already be visible for the users even before policy execution end.

To tweak and control the incremental refresh some new options was added to add_continuous_aggregate_policy API:

buckets_per_batch: number of buckets to be refreshed by a "batch". To summarize this value is multiplied by the CAgg bucket width to determine the size of the batch range. Default value is 0 (zero) that means it will keep the current behavior of single batch execution. Values less than 0 (zero) are not allowed.
max_batches_per_execution: maximum number of batches to be executed by a policy execution. This option is used to limit the number of batches processed by a single policy execution, so if some batches remain next time the policy run they will be processed. Default value is 10 (ten) that means that each job execution will process the maximum of ten batches. To make it unlimited then the value should be 0 (zero). Values less than 0 (zero) are not allowed.

codecov · 2025-03-04T19:24:43Z

Codecov Report

Attention: Patch coverage is 82.60870% with 32 lines in your changes missing coverage. Please review.

Project coverage is 81.89%. Comparing base (59f50f2) to head (a1d1109).
Report is 811 commits behind head on main.

Files with missing lines	Patch %	Lines
tsl/src/continuous_aggs/refresh.c	79.13%	9 Missing and 15 partials ⚠️
tsl/src/bgw_policy/continuous_aggregate_api.c	84.21%	2 Missing and 1 partial ⚠️
tsl/src/bgw_policy/job.c	89.65%	0 Missing and 3 partials ⚠️
src/ts_catalog/continuous_agg.c	80.00%	0 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7790      +/-   ##
==========================================
+ Coverage   80.06%   81.89%   +1.82%     
==========================================
  Files         190      247      +57     
  Lines       37181    45685    +8504     
  Branches     9450    11431    +1981     
==========================================
+ Hits        29770    37412    +7642     
- Misses       2997     3776     +779     
- Partials     4414     4497      +83

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

mkindahl

Some minor comments. Since it is in draft, I will wait with approving until you have the final version.

sql/policy_api.sql

src/ts_catalog/continuous_agg.c

tsl/src/bgw_policy/policies_v2.h

mkindahl

A few questions regarding some parts of the code where I am not sure if it is correct or not.

tsl/src/continuous_aggs/refresh.c

tsl/src/bgw_policy/job.c

gayyappan · 2025-03-10T14:38:36Z

max_batches_per_execution: maximum number of batches to be executed by a policy execution. This option is used to limit the number of batches processed by a single policy execution, so if some batches remain next time the policy run they will be processed. Default value is 10 (ten) that means that each job execution will process the maximum of ten batches. Values less than 0 (zero) are not allowed.
Why not let the default behavior process all batches ? That behavior is more intuitive than restricting this to a predefined number of batches for every policy run.

tsl/src/continuous_aggs/refresh.c

fabriziomello · 2025-03-10T15:34:10Z

max_batches_per_execution: maximum number of batches to be executed by a policy execution. This option is used to limit the number of batches processed by a single policy execution, so if some batches remain next time the policy run they will be processed. Default value is 10 (ten) that means that each job execution will process the maximum of ten batches. Values less than 0 (zero) are not allowed.
Why not let the default behavior process all batches ? That behavior is more intuitive than restricting this to a predefined number of batches for every policy run.

The default behavior will process all batches, this option will make sense ONLY when buckets_per_batch>0. The idea of this advanced configuration is to somehow reduce resource consumption spikes during the refresh and we made this assumption to limit processing 10 batches per execution. We can make it unlimited by setting it to 0 (zero) but we really don't know now what is a good value for it so we're making it conservative so you should explicit upper limit the maximum batches to be processed when setting a value greater than zero to buckets_per_batch.

Nowadays a Continuous Aggregate refresh policy process everything only once independent of how large the refresh window is. For example if you have a hypertable with a huge amount of rows it can take a lot of time and requires a lot of resources in terms of CPU, Memory and I/O to refresh a CAgg, and all the aggregated data will be visible for the users only when the refresh policy complete it execution. This PR add the capability of a CAgg refresh policy be executed incrementaly in "batches". Each "batch" is an individual transaction that will process a small fraction of the entire refresh window, and once the "batch" finishes the execution the data refreshed will already be visible for the users even before policy execution end. To tweak and control the incremental refresh some new options was added to `add_continuous_aggregate_policy` API: * `buckets_per_batch`: number of buckets to be refreshed by a "batch". To summarize this value is multiplied by the CAgg bucket width to determine the size of the batch range. Default value is `0` (zero) that means it will keep the current behavior of single batch execution. Values less than `0` (zero) are not allowed. * `max_batches_per_execution`: maximum number of batches to be executed by a policy execution. This option is used to limit the number of batches processed by a single policy execution, so if some batches remain next time the policy run they will be processed. Default value is `10` (ten) that means that each job execution will process the maximum of ten batches. To make it unlimited then the value should be `0` (zero). Values less than `0` (zero) are not allowed.

tsl/src/continuous_aggs/refresh.c

Related to this PR: timescale/timescaledb#7790 Signed-off-by: Fabrízio de Royes Mello <fabriziomello@gmail.com>

@bjornuppeke

## 2.19.0 (2025-03-12) This release contains performance improvements and bug fixes since the 2.18.2 release. We recommend that you upgrade at the next available opportunity. **Features** * [#7586](#7586) Vectorized aggregation with grouping by a single text column. * [#7632](#7632) Optimize recompression for chunks without segmentby * [#7655](#7655) Support vectorized aggregation on Hypercore TAM * [#7669](#7669) Add support for merging compressed chunks * [#7701](#7701) Implement a custom compression algorithm for bool columns. It is experimental and can undergo backwards-incompatible changes. For testing, enable it using timescaledb.enable_bool_compression = on. * [#7707](#7707) Support ALTER COLUMN SET NOT NULL on compressed chunks * [#7765](#7765) Allow tsdb as alias for timescaledb in WITH and SET clauses * [#7786](#7786) Show warning for inefficient compress_chunk_time_interval configuration * [#7788](#7788) Add callback to mem_guard for background workers * [#7789](#7789) Do not recompress segmentwise when default order by is empty * [#7790](#7790) Add configurable Incremental CAgg Refresh Policy **Bugfixes** * [#7665](#7665) Block merging of frozen chunks * [#7673](#7673) Don't abort additional INSERTs when hitting first conflict * [#7714](#7714) Fixes a wrong result when compressed NULL values were confused with default values. This happened in very special circumstances with alter table added a new column with a default value, an update and compression in a very particular order. * [#7747](#7747) Block TAM rewrites with incompatible GUC setting * [#7748](#7748) Crash in the segmentwise recompression * [#7764](#7764) Fix compression settings handling in Hypercore TAM * [#7768](#7768) Remove costing index scan of hypertable parent * [#7799](#7799) Handle DEFAULT table access name in ALTER TABLE **Thanks** * @bjornuppeke for reporting a problem with INSERT INTO ... ON CONFLICT DO NOTHING on compressed chunks * @kav23alex for reporting a segmentation fault on ALTER TABLE with DEFAULT Signed-off-by: Philip Krauss <35487337+philkra@users.noreply.github.com>

@bjornuppeke

## 2.19.0 (2025-03-18) This release contains performance improvements and bug fixes since the 2.18.2 release. We recommend that you upgrade at the next available opportunity. * Improved concurrency of INSERT, UPDATE and DELETE operations on the columnstore by no longer blocking DML statements during the recompression of a chunk. * Improved system performance during Continuous Aggregates refreshes by breaking them into smaller batches which reduces systems pressure and minimizes the risk of spilling to disk. * Faster and more up-to-date results for queries against Continuous Aggregates by materializing the most recent data first (vs old data first in prior versions). * Faster analytical queries with SIMD vectorization of aggregations over text columns and group by over multiple column * Enable optimizing chunk size for faster query performance on the columnstore by adding support for merging columnstore chunks to the merge_chunk API. **Deprecation warning** This is the last minor release supporting PostgreSQL 14. Starting with the minor version of TimescaleDB only Postgres 15, 16 and 17 will be supported. **Downgrading of 2.19.0** This release introduces custom bool compression, if you enable this feature via the `enable_bool_compression` and must downgrade to a previous, please use the [following script](https://github.com/timescale/timescaledb-extras/blob/master/utils/2.19.0-downgrade_new_compression_algorithms.sql) to convert the columns back to their previous state. TimescaleDB versions prior to 2.19.0 do not know how to handle this new type. **Features** * [#7586](#7586) Vectorized aggregation with grouping by a single text column. * [#7632](#7632) Optimize recompression for chunks without segmentby * [#7655](#7655) Support vectorized aggregation on Hypercore TAM * [#7669](#7669) Add support for merging compressed chunks * [#7701](#7701) Implement a custom compression algorithm for bool columns. It is experimental and can undergo backwards-incompatible changes. For testing, enable it using timescaledb.enable_bool_compression = on. * [#7707](#7707) Support ALTER COLUMN SET NOT NULL on compressed chunks * [#7765](#7765) Allow tsdb as alias for timescaledb in WITH and SET clauses * [#7786](#7786) Show warning for inefficient compress_chunk_time_interval configuration * [#7788](#7788) Add callback to mem_guard for background workers * [#7789](#7789) Do not recompress segmentwise when default order by is empty * [#7790](#7790) Add configurable Incremental CAgg Refresh Policy **Bugfixes** * [#7665](#7665) Block merging of frozen chunks * [#7673](#7673) Don't abort additional INSERTs when hitting first conflict * [#7714](#7714) Fixes a wrong result when compressed NULL values were confused with default values. This happened in very special circumstances with alter table added a new column with a default value, an update and compression in a very particular order. * [#7747](#7747) Block TAM rewrites with incompatible GUC setting * [#7748](#7748) Crash in the segmentwise recompression * [#7764](#7764) Fix compression settings handling in Hypercore TAM * [#7768](#7768) Remove costing index scan of hypertable parent * [#7799](#7799) Handle DEFAULT table access name in ALTER TABLE **GUCs** * `enable_bool_compression`: enable the BOOL compression algorithm, default: `OFF` * `enable_exclusive_locking_recompression`: enable exclusive locking during recompression (legacy mode), default: `OFF` **Thanks** * @bjornuppeke for reporting a problem with INSERT INTO ... ON CONFLICT DO NOTHING on compressed chunks * @kav23alex for reporting a segmentation fault on ALTER TABLE with DEFAULT --------- Signed-off-by: Philip Krauss <35487337+philkra@users.noreply.github.com> Signed-off-by: Ramon Guiu <ramon@timescale.com> Co-authored-by: Ramon Guiu <ramon@timescale.com>

fabriziomello added Continuous Aggregate jobs labels Mar 4, 2025

fabriziomello self-assigned this Mar 4, 2025

fabriziomello force-pushed the cagg_refresh_policy_incremental branch 10 times, most recently from 81f49e3 to b697dae Compare March 5, 2025 22:40

fabriziomello added this to the v2.19.0 milestone Mar 5, 2025

fabriziomello mentioned this pull request Mar 5, 2025

Batched materialization in CAgg refresh #7784

Closed

mkindahl reviewed Mar 6, 2025

View reviewed changes

sql/policy_api.sql Outdated Show resolved Hide resolved

sql/policy_api.sql Outdated Show resolved Hide resolved

src/ts_catalog/continuous_agg.c Show resolved Hide resolved

tsl/src/bgw_policy/policies_v2.h Outdated Show resolved Hide resolved

fabriziomello force-pushed the cagg_refresh_policy_incremental branch from 4e90f1e to d976191 Compare March 6, 2025 23:59

fabriziomello marked this pull request as ready for review March 7, 2025 00:14

mkindahl reviewed Mar 7, 2025

View reviewed changes

tsl/src/continuous_aggs/refresh.c Outdated Show resolved Hide resolved

tsl/src/continuous_aggs/refresh.c Outdated Show resolved Hide resolved

tsl/src/continuous_aggs/refresh.c Outdated Show resolved Hide resolved

tsl/src/continuous_aggs/refresh.c Outdated Show resolved Hide resolved

fabriziomello force-pushed the cagg_refresh_policy_incremental branch from d865281 to b139fa1 Compare March 7, 2025 22:26

mkindahl approved these changes Mar 10, 2025

View reviewed changes

tsl/src/bgw_policy/job.c Outdated Show resolved Hide resolved

fabriziomello force-pushed the cagg_refresh_policy_incremental branch from 3ee29cd to 8dcbe0e Compare March 10, 2025 13:38

erimatnor approved these changes Mar 10, 2025

View reviewed changes

tsl/src/continuous_aggs/refresh.c Outdated Show resolved Hide resolved

tsl/src/continuous_aggs/refresh.c Outdated Show resolved Hide resolved

fabriziomello force-pushed the cagg_refresh_policy_incremental branch from 78be03a to 3eb84bd Compare March 10, 2025 15:32

fabriziomello force-pushed the cagg_refresh_policy_incremental branch from a9c1954 to 9cf7694 Compare March 10, 2025 16:46

fabriziomello force-pushed the cagg_refresh_policy_incremental branch from e170e39 to a1d1109 Compare March 10, 2025 17:07

gayyappan reviewed Mar 10, 2025

View reviewed changes

tsl/src/continuous_aggs/refresh.c Show resolved Hide resolved

fabriziomello merged commit 627c36f into timescale:main Mar 10, 2025
48 of 50 checks passed

fabriziomello added a commit to timescale/docs that referenced this pull request Mar 10, 2025

New add_continuous_aggregate_policy API options

c6cf62b

Related to this PR: timescale/timescaledb#7790 Signed-off-by: Fabrízio de Royes Mello <fabriziomello@gmail.com>

fabriziomello mentioned this pull request Mar 10, 2025

New add_continuous_aggregate_policy API options timescale/docs#3913

Merged

fabriziomello added a commit to timescale/docs that referenced this pull request Mar 12, 2025

New add_continuous_aggregate_policy API options

a5ab1ff

Related to this PR: timescale/timescaledb#7790 Signed-off-by: Fabrízio de Royes Mello <fabriziomello@gmail.com>

This was referenced Mar 12, 2025

CHANGELOG for 2.19.0 #7824

Closed

CHANGELOG for 2.19.0 #7829

Merged

bayandin mentioned this pull request Mar 21, 2025

timescaledb 2.19.0 bayandin/homebrew-tap#255

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incremental CAgg Refresh Policy #7790

Incremental CAgg Refresh Policy #7790

fabriziomello commented Mar 4, 2025 •

edited

Loading

codecov bot commented Mar 4, 2025 •

edited

Loading

mkindahl left a comment

mkindahl left a comment

gayyappan commented Mar 10, 2025

fabriziomello commented Mar 10, 2025 •

edited

Loading

Incremental CAgg Refresh Policy #7790

Incremental CAgg Refresh Policy #7790

Conversation

fabriziomello commented Mar 4, 2025 • edited Loading

codecov bot commented Mar 4, 2025 • edited Loading

Codecov Report

mkindahl left a comment

Choose a reason for hiding this comment

mkindahl left a comment

Choose a reason for hiding this comment

gayyappan commented Mar 10, 2025

fabriziomello commented Mar 10, 2025 • edited Loading

fabriziomello commented Mar 4, 2025 •

edited

Loading

codecov bot commented Mar 4, 2025 •

edited

Loading

fabriziomello commented Mar 10, 2025 •

edited

Loading