Add capabilities to ingest from another stream without disabling the realtime table by sajjad-moradi · Pull Request #9289 · apache/pinot

sajjad-moradi · 2022-08-27T19:00:37Z

Description

With this PR, now an operator can change the underlying stream without disabling the table. The operator needs to do the followings:

Issue a pause request to the table,
Change the stream by modifying table's stream configs like topic name, cluster name, etc.
Issue a resume request with the desired offset criteria

When a stream is changed, partitions will have new offsets. Before this PR, resume request could only resume the stream from the offsets at which the stream was paused. Since the offsets from previous stream can't be used for the new stream, the resume request should specify offset criteria for the starting point of consumption from the new stream.

Testing Done

Modified LLCRealtimeClusterIntegrationTest locally and created two topics (the consumption started with the first topic)
Issued a pause request
Modified the table config to point to the second topic
Issued a resume request with 'smallest' as offset criteria
Verified that the records in the table include data from both topics
Verified the desired values in segment ZK metadata
Repeated the above steps with 'largest' offset criteria
Repeated the above steps with no offset criteria (for this case topic wasn't changed)

codecov-commenter · 2022-08-27T19:36:35Z

Codecov Report

Merging #9289 (c43ae70) into master (5ecca80) will increase coverage by 53.29%.
The diff coverage is 42.85%.

@@              Coverage Diff              @@
##             master    #9289       +/-   ##
=============================================
+ Coverage     15.28%   68.57%   +53.29%     
- Complexity      168     5007     +4839     
=============================================
  Files          1814     1867       +53     
  Lines         97379    99659     +2280     
  Branches      14893    15158      +265     
=============================================
+ Hits          14880    68345    +53465     
+ Misses        81367    26396    -54971     
- Partials       1132     4918     +3786

Flag	Coverage Δ
integration1	`26.24% <42.85%> (?)`
unittests1	`67.08% <ø> (?)`
unittests2	`15.28% <38.77%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
...ller/api/resources/PinotRealtimeTableResource.java	`0.00% <0.00%> (ø)`
...r/validation/RealtimeSegmentValidationManager.java	`74.32% <33.33%> (+30.66%)`	⬆️
.../core/realtime/PinotLLCRealtimeSegmentManager.java	`70.56% <47.61%> (+9.87%)`	⬆️
...pinot/plugin/metrics/yammer/YammerJmxReporter.java	`100.00% <0.00%> (ø)`
...ugin/inputformat/csv/CSVRecordExtractorConfig.java	`0.00% <0.00%> (ø)`
.../pinot/server/api/resources/PinotServerLogger.java	`0.00% <0.00%> (ø)`
.../starter/helix/HelixInstanceDataManagerConfig.java	`80.43% <0.00%> (ø)`
...che/pinot/plugin/metrics/yammer/YammerMetered.java	`25.00% <0.00%> (ø)`
...plugin/segmentuploader/SegmentUploaderDefault.java	`87.09% <0.00%> (ø)`
.../helix/FreshnessBasedConsumptionStatusChecker.java	`0.00% <0.00%> (ø)`
... and 1383 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

mcvsubbu · 2022-08-27T20:28:48Z

...ller/src/main/java/org/apache/pinot/controller/api/resources/PinotRealtimeTableResource.java

Instead of offsetCriteria, can we call it consumeFrom? We can translate it to whatever we want internally.

And the possible values can be forceEarliest, forceLatest, or best (yes, we need a third value).

forceEarliest and forceLatest will ignore any previous completed segment offsets and just do as forced. Pick the earliest or latest offsets and resume consumption.

The best option will attempt to minimize data loss, pick up the first available event after the last consumed event.

👍 on consumeFrom. It reads better that way, but I prefer to keep smallest and largest so the uri looks like:
/tables/<tableName>/resumeConsumption?consumeFrom=smallest

For the behavior of the best option you suggested, we currently achieve that if consumeFrom parameter is not specified.

Let us add all three options, and say that the default is best. Also, I prefer earliest and latest as being more intuitive. Alternative: oldestEvents mostRecentEvents, fromLastPause

npawar · 2022-08-29T03:07:11Z

...ller/src/main/java/org/apache/pinot/controller/api/resources/PinotRealtimeTableResource.java

why not just keep this as smallest/largest, since user is already familiar with those for table consumption?

Sajjad originally had offsetCriteria=smallest|largest. I was the one asking him to change, and we are now at consumeFrom=earliest|latest|best.

Since two of you prefer smallest/largest, I can go with that. I also felt that we should have a third (optional) value. If we settle on best, then most people may choose that thinking it is some automatic way. In reality, what we mean is that we will pick up from where we left off.

If you both feel that leaving out the third option is ideal, I can go with that as well.

Let us wrap this up today, thanks.

This is how I see pause/resume feature. Operator pauses consumption for some reason. After a while, they want to resume the consumption. The default behavior should be to pick up the events from the offset where we left off. If the offset is gone, we should automatically start with the smallest available offset. So that's the default behavior, but if operator wants to change the offset for some reason like stream connection change, then consumption should resume based on the provided "resumeFrom" parameter. As Neha mentioned, since users/operators are familiar with smallest/largest offset criteria, IMO it's better to use the same values.

+1 on keeping smallest/largest. Imo no need of "best" as that's the default anyway.

npawar · 2022-08-29T03:14:35Z

...ller/src/main/java/org/apache/pinot/controller/api/resources/PinotRealtimeTableResource.java

should we have the timestamp and period offset criteria also, now that we support those too?

These may not apply to the use cases that we are considering, but for completeness, we can have it (or, we can choose to add later)

The change is very simple, but I think we can add it later when there's a request for it.

mcvsubbu

The approach looks fine tome, the code is a bit confusing to read.

Overall, I am fine with this

mcvsubbu · 2022-08-29T17:46:33Z

...ain/java/org/apache/pinot/controller/helix/core/realtime/PinotLLCRealtimeSegmentManager.java

Can u add a comment before this line what this map is supposed to contain?

The logic in this class is getting quite hard to read, can we even base class some methods and sub-class the partitionGroup vs partitionId for the two different type of streams we suppoer?

I'll add the comments.

For the refactoring, I agree with you that this class is hard to read. It already has more than 1500 lines and it's worth refactoring, but the refactoring is outside the scope of this PR.

mcvsubbu · 2022-08-29T17:51:37Z

...ain/java/org/apache/pinot/controller/helix/core/realtime/PinotLLCRealtimeSegmentManager.java

It may make things more readable if we can get the smallest offset all the time? Does it involve multiple calls to the stream, and is that what we are optimizing here? If so, good to add a comment. Otherwise, getting it once unconditionally make make things a bit more readable.

Yes, this is for optimization. Some topics have hundreds of partitions and we shouldn't call the stream to get the same metadata hundreds of times. I'll add the comments.

mcvsubbu · 2022-08-29T17:56:38Z

...ain/java/org/apache/pinot/controller/helix/core/realtime/PinotLLCRealtimeSegmentManager.java

Can you consider removing the offsetCriteria from the argument here, and incorporating the logic to deal with a non-null value of offsetCriteria outside this method? Not sure if it will make the logic more readable, but worth a try, I think

selectStartOffset method is used in two places. If we move the non-null check out of this method, we need to repeat the non-null check two times. That's why it was added to the method.
Also, this method select the start offset. If the offset criteria is provided, it gets the start offset from one map and if not, it gets it from the other map. So that's another reason I think having the non-null check belongs to this method.

mcvsubbu

Sajjad and I met offline and decided that it is best we stick to the well-known terms here.

I have no further comments, lgtm

…realtime table (apache#9289)

…realtime table (#9289) (#9316)

…realtime table (#9289)

sajjad-moradi added the feature label Aug 27, 2022

sajjad-moradi requested review from Jackie-Jiang, mcvsubbu and npawar August 27, 2022 19:00

mcvsubbu reviewed Aug 27, 2022

View reviewed changes

sajjad-moradi force-pushed the feature/resume.with.offset.criteria branch 2 times, most recently from 7d2cb81 to 757a3c3 Compare August 29, 2022 00:54

npawar reviewed Aug 29, 2022

View reviewed changes

mcvsubbu reviewed Aug 29, 2022

View reviewed changes

sajjad-moradi added 4 commits August 29, 2022 12:43

Resume consumption with offset criteria

b0e70d7

checkstyle

11c023b

Add some comments

cc950eb

earliest|latest -> smallest|largest

c43ae70

sajjad-moradi force-pushed the feature/resume.with.offset.criteria branch from c78ef83 to c43ae70 Compare August 29, 2022 19:50

mcvsubbu approved these changes Aug 30, 2022

View reviewed changes

update api docs

16e2c9a

sajjad-moradi merged commit a273af0 into apache:master Aug 30, 2022

sajjad-moradi mentioned this pull request Aug 30, 2022

[question] Even if I recreate the kafka topic or modify the topic properties, I wonder how consuming can continue to do it. #7100

Closed

walterddr pushed a commit to walterddr/pinot that referenced this pull request Sep 1, 2022

Add capabilities to ingest from another stream without disabling the …

cc5e439

…realtime table (apache#9289)

walterddr mentioned this pull request Sep 1, 2022

Add capabilities to ingest from another stream without disabling the realtime table (#9289) #9316

Merged

xiangfu0 pushed a commit that referenced this pull request Sep 1, 2022

Add capabilities to ingest from another stream without disabling the …

aaebcef

…realtime table (#9289) (#9316)

xiangfu0 pushed a commit that referenced this pull request Sep 1, 2022

Add capabilities to ingest from another stream without disabling the …

ce49c4f

…realtime table (#9289)

Conversation

sajjad-moradi commented Aug 27, 2022

Description

Testing Done

Uh oh!

codecov-commenter commented Aug 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mcvsubbu left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mcvsubbu left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov-commenter commented Aug 27, 2022 •

edited

Loading