-
Notifications
You must be signed in to change notification settings - Fork 28.6k
[SPARK-51172][SS] Rename to spark.sql.optimizer.pruneFiltersCanPruneStreamingSubplan #49897
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
@@ -4110,7 +4110,7 @@ object SQLConf { | |||
.createWithDefault(ByteArrayMethods.MAX_ROUNDED_ARRAY_LENGTH) | |||
|
|||
val PRUNE_FILTERS_CAN_PRUNE_STREAMING_SUBPLAN = | |||
buildConf("spark.databricks.sql.optimizer.pruneFiltersCanPruneStreamingSubplan") | |||
buildConf("spark.sql.optimizer.pruneFiltersCanPruneStreamingSubplan") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice catch!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, LGTM. Thank you, @HyukjinKwon .
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you file a new JIRA issue for this because this should have a fix version |
uhoh |
sure, will do |
71a09c9
to
2e544ff
Compare
Hmmmm .. @HeartSaVioR WDYT? should we only make this change in master branch alone? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, LGTM again.
BTW, I made a new scalastyle rule to prevent this mistake because we missed this mistake completely at 3.5.4 and almost in 4.0.0, @HyukjinKwon .
Let's fix in master/4.0 first to avoid making more releases shipping this. We should probably think through how to not break 3.5 itself + upgrading from 3.5 to 4.0+ when fixing in 3.5. If we weren't released this in 3.5 that'd be ideal but... |
Merged to master and branch-4.0. |
…treamingSubplan ### What changes were proposed in this pull request? This PR is a followup of #48149 that proposes to rename `spark.databricks.sql.optimizer.pruneFiltersCanPruneStreamingSubplan` to `spark.sql.optimizer.pruneFiltersCanPruneStreamingSubplan` ### Why are the changes needed? For consistent naming. ### Does this PR introduce _any_ user-facing change? No, the main PR has not been released yet. ### How was this patch tested? CI should test it out. ### Was this patch authored or co-authored using generative AI tooling? No. Closes #49897 from HyukjinKwon/SPARK-49699. Authored-by: Hyukjin Kwon <gurwls223@apache.org> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org> (cherry picked from commit decb677) Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
### What changes were proposed in this pull request? This PR aims to add `configName` Scalastyle rule to prevent invalid config names. ### Why are the changes needed? To prevent repetitive mistake pattern - #45649 - #48149 - #49121 ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Currently, this PR will fail at Scalastyle test because the `master` branch is broken. We can merge this after the following PR. - #49897 ### Was this patch authored or co-authored using generative AI tooling? No. Closes #49900 from dongjoon-hyun/SPARK-51173. Authored-by: Dongjoon Hyun <dongjoon@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
### What changes were proposed in this pull request? This PR aims to add `configName` Scalastyle rule to prevent invalid config names. ### Why are the changes needed? To prevent repetitive mistake pattern - #45649 - #48149 - #49121 ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Currently, this PR will fail at Scalastyle test because the `master` branch is broken. We can merge this after the following PR. - #49897 ### Was this patch authored or co-authored using generative AI tooling? No. Closes #49900 from dongjoon-hyun/SPARK-51173. Authored-by: Dongjoon Hyun <dongjoon@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> (cherry picked from commit 274dc5e) Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
There are two cases:
1> This will be written back to offset log to ensure this value to be kept on streaming query lifecycle. When they upgrade to the Spark version which we renamed the config, there is offset log to read the static config back, and there is no entry for This could break the query if the rule impacts the query, because the effectiveness of the fix is flipped. 2> When they further upgrade this query to the Spark version which we renamed the config, same thing applies and the fix is disabled. So no change. I feel like this is a sort of "dead end" because if we don't want to make the case 1 to break, the only way is to make an alias of config, which enforces us to keep this problematic config name much longer (probably forever, as technically they can jump from Spark 3.5.4 to any future version). The chance to hit this case is not very high, but when they encounter this, they'd need to drop checkpoint and start from the scratch, which is unpleasant. Not sure how we could inform Spark 3.5.4 users about this - maybe should we mention about this in release note for 3.5.5 / 4.0.0? Anyway, we should apply the same fix in Spark 3.5 branch line. I can't imagine there is other way around. |
…treamingSubplan This PR is a followup of apache#48149 that proposes to rename `spark.databricks.sql.optimizer.pruneFiltersCanPruneStreamingSubplan` to `spark.sql.optimizer.pruneFiltersCanPruneStreamingSubplan` For consistent naming. No, the main PR has not been released yet. CI should test it out. No. Closes apache#49897 from HyukjinKwon/SPARK-49699. Authored-by: Hyukjin Kwon <gurwls223@apache.org> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
Also a tiny fix if folks can merge |
This PR aims to add `configName` Scalastyle rule to prevent invalid config names. To prevent repetitive mistake pattern - #45649 - #48149 - #49121 No. Currently, this PR will fail at Scalastyle test because the `master` branch is broken. We can merge this after the following PR. - #49897 No. Closes #49900 from dongjoon-hyun/SPARK-51173. Authored-by: Dongjoon Hyun <dongjoon@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> (cherry picked from commit 274dc5e) Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
According to https://lists.apache.org/thread/25xct2zv31kh1plpy766crs3bd6go5s6, I'm -0.99 on this PR, and I'm willing to cast -1 on this PR anytime soon, because my comment remains unaddressed for more than 1 month from merging this PR.
I got thumbs up, so this is not something I only think this is critical. There are argument that "users can upgrade to Spark 3.5.5 before Spark 4.0.0", which I do not believe I agree with. Since both are casting -1 for each other's proposal, I think my comment is NOT addressed. I'm waiting for feedback in dev@, but if we fail to make consensus on the approach on migration, I really have to cast -1 and revert this PR in master/4.0. |
Here the meaning of "my comment is NOT addressed" is,
So we never have a way to address my question "We should probably think through how to not break 3.5 itself + upgrading from 3.5 to 4.0+ when fixing in 3.5." which got community's consensus. This is precondition of merging this PR and I hope we quickly address the issue, hence the migration logic came up, but... |
What changes were proposed in this pull request?
This PR is a followup of #48149 that proposes to rename
spark.databricks.sql.optimizer.pruneFiltersCanPruneStreamingSubplan
tospark.sql.optimizer.pruneFiltersCanPruneStreamingSubplan
Why are the changes needed?
For consistent naming.
Does this PR introduce any user-facing change?
No, the main PR has not been released yet.
How was this patch tested?
CI should test it out.
Was this patch authored or co-authored using generative AI tooling?
No.