Use long windows when catching up #67

avelanarius · 2022-02-25T05:59:54Z

Changes the strategy the CDC library uses to generate windows to read the CDC log. Previously, when a library was started, it would read queryTimeWindowSize large windows starting from |now - ttl|. The issue with that approach is that it requires a large number of windows to catch up, for example: 24 hours / 30 seconds * 256 * 8 = 5898240 for a 8-shard Scylla server.

A new approach just creates a single large window from |now - ttl| to |now - confidence window|.

haaawk · 2022-02-28T11:28:16Z

scylla-cdc-base/src/main/java/com/scylladb/cdc/model/worker/TaskState.java

+    public static TaskState createInitialFor(GenerationId generation, Timestamp now,
+                                             long confidenceWindowSizeMs, long queryTimeWindowSizeMs) {
+        // Start reading at generation start:
+        Timestamp windowStart = generation.getGenerationStart();


What was the conclusion of your tests @avelanarius? Does it make sense to take max(generation.getGenerationStart(), now - CDC ttl)?

Yes, it made sense. The test was the following: create a table with small TTL (3 seconds), insert 10^6 rows, then do two types of selects: select of small window and select of a large window. Both of those queries returned no rows (3 second TTL...), but a select of small window was faster. Sometimes, select of a large window even caused timeouts (even though no rows were returned). We hypothesized that this is due to replicas sending "tombstones" to coordinator.

haaawk · 2022-02-28T11:30:06Z

scylla-cdc-base/src/main/java/com/scylladb/cdc/model/worker/TaskState.java

+        // queryTimeWindowSizeMs large (the consumer might need to wait a bit
+        // for the window to be ready for reading).
+        Timestamp windowEnd = now.plus(-confidenceWindowSizeMs, ChronoUnit.MILLIS);
+        if (windowEnd.compareTo(windowStart) < 0) {


Is this enough? Wouldn't we want to adjust windowEnd also when windowEnd - windowStart < queryTimeWindowSizeMs?

Maybe, but this is only an initial window so it doesn't really matter.

I could add this at the cost of additional complexity.

haaawk · 2022-02-28T11:32:01Z

scylla-cdc-base/src/main/java/com/scylladb/cdc/model/worker/TaskState.java


-        return this;
+        // Trim the start of the window with minimumWindowStart.
+        Timestamp newWindowStart = windowStart;


Why wasn't this needed before and now is? Or was it and it was a bug we weren't trimming the start?

Previously it only handled one case:

---- WINDOW --- | TTL -> ---- WINDOW ---

Now it also handles this case:

---- WINDOW --- | TTL -> - WINDOW - | TTL

Previously it didn't really matter to do the second type of trimming, because the windows were very small (default 30 seconds).

haaawk

LGTM in general

Changes the strategy the CDC library uses to generate windows to read the CDC log. Previously, when a library was started, it would read queryTimeWindowSize large windows starting from |now - ttl|. The issue with that approach is that it requires a large number of windows to catch up, for example: 24 hours / 30 seconds * 256 * 8 = 5898240 for a 8-shard Scylla server. A new approach just creates a single large window from |now - ttl| to |now - confidence window|.

avelanarius · 2022-03-10T12:38:36Z

v2: fixed flaky unit test. (off-by-one error)

racevedoo · 2022-05-16T13:12:38Z

Any news here? I'm pretty interested in this PR

haaawk reviewed Feb 28, 2022

View reviewed changes

avelanarius force-pushed the long-windows branch from 1b7f0ba to ec66dab Compare March 10, 2022 12:37

avelanarius force-pushed the master branch from d731273 to 786893c Compare November 23, 2022 15:08

dkropachev force-pushed the master branch from db0183a to 7c7b9a1 Compare April 9, 2025 16:02

dkropachev force-pushed the master branch 4 times, most recently from 012c0c8 to c89661c Compare June 17, 2025 10:45

dkropachev force-pushed the master branch 16 times, most recently from 396912a to 78c0d0b Compare June 21, 2025 03:27

dkropachev force-pushed the master branch 13 times, most recently from 3c4bf55 to 3318b32 Compare June 23, 2025 17:23

dkropachev force-pushed the master branch 8 times, most recently from 5031da9 to b5730b8 Compare August 2, 2025 13:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use long windows when catching up #67

Use long windows when catching up #67

Uh oh!

avelanarius commented Feb 25, 2022

Uh oh!

haaawk Feb 28, 2022

Uh oh!

avelanarius Mar 7, 2022

Uh oh!

haaawk Feb 28, 2022

Uh oh!

avelanarius Mar 7, 2022

Uh oh!

avelanarius Mar 7, 2022

Uh oh!

haaawk Feb 28, 2022

Uh oh!

avelanarius Mar 7, 2022

Uh oh!

haaawk left a comment

Uh oh!

avelanarius commented Mar 10, 2022

Uh oh!

racevedoo commented May 16, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Use long windows when catching up #67

Are you sure you want to change the base?

Use long windows when catching up #67

Uh oh!

Conversation

avelanarius commented Feb 25, 2022

Uh oh!

haaawk Feb 28, 2022

Choose a reason for hiding this comment

Uh oh!

avelanarius Mar 7, 2022

Choose a reason for hiding this comment

Uh oh!

haaawk Feb 28, 2022

Choose a reason for hiding this comment

Uh oh!

avelanarius Mar 7, 2022

Choose a reason for hiding this comment

Uh oh!

avelanarius Mar 7, 2022

Choose a reason for hiding this comment

Uh oh!

haaawk Feb 28, 2022

Choose a reason for hiding this comment

Uh oh!

avelanarius Mar 7, 2022

Choose a reason for hiding this comment

Uh oh!

haaawk left a comment

Choose a reason for hiding this comment

Uh oh!

avelanarius commented Mar 10, 2022

Uh oh!

racevedoo commented May 16, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants