[SPARK-14796][SQL] Add spark.sql.optimizer.inSetConversionThreshold config option.#12562
[SPARK-14796][SQL] Add spark.sql.optimizer.inSetConversionThreshold config option.#12562dongjoon-hyun wants to merge 2 commits intoapache:masterfrom dongjoon-hyun:SPARK-14796
Conversation
|
Can we add a unit test in the appropriate optimizer suite? We also need to come up with a better name. |
|
Thank you for review, @rxin . |
|
Test build #56492 has finished for PR 12562 at commit
|
|
Oh, sorry. There exists already |
|
Test build #56511 has finished for PR 12562 at commit
|
|
Test build #56514 has finished for PR 12562 at commit
|
|
Test build #56550 has finished for PR 12562 at commit
|
There was a problem hiding this comment.
I'd give this a more descriptive name, and explicitly say setting the threshold for turning into InSet
|
maybe inSetConversionThreshold? |
|
Thank you so much, @rxin and @marmbrus !
By the way, @marmbrus d you mean the duplication of value |
|
Test build #56573 has finished for PR 12562 at commit
|
|
Merging in master. Thanks. |
What changes were proposed in this pull request?
Currently,
OptimizeInoptimizer replacesInexpression intoInSetexpression if the size of set is greater than a constant, 10.This issue aims to make a configuration
spark.sql.optimizer.inSetConversionThresholdfor that.After this PR,
OptimizerInis configurable.How was this patch tested?
Pass the Jenkins tests (with a new testcase)