-
Notifications
You must be signed in to change notification settings - Fork 2.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spark 3.4: Support rate limit in Spark Streaming #7422
Spark 3.4: Support rate limit in Spark Streaming #7422
Conversation
Output of :
spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/SparkPositionDeltaOperation.java |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the forward port, looks good to me, just waiting for CI to complete.
spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/SparkReadConf.java
Outdated
Show resolved
Hide resolved
spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/SparkMicroBatchStream.java
Outdated
Show resolved
Hide resolved
// TODO : use readLimit provided in function param, the readLimits are derived from | ||
// these 2 properties. | ||
if ((curFilesAdded + 1) > maxFilesPerMicroBatch | ||
|| (curRecordCount + task.file().recordCount()) > maxRecordsPerMicroBatch) { | ||
shouldContinueReading = false; | ||
break; | ||
} | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since this is a forward port this should be fine for now but probably worth creating an issue to track the ToDo to use the provided ReadLimit
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ACK let me take this as follow-up of this pr immediately and add a tracking issue as well meanwhile.
@singhpk234 could you rebase the PR based on the refactoring? |
6fe4dc6
to
076a5d7
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good to me!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM as well, thanks @singhpk234 !
I think the comments are all addressed, will go ahead and merge it. Thanks everyone for the review! |
@singhpk234 @jackye1995 |
Yea this test fail for my pr run as well.. |
Added a pr : #7470 for the fix. |
Thanks @singhpk234. |
About the change
Forward port #4479 to spark 3.4
cc @jackye1995