-
Notifications
You must be signed in to change notification settings - Fork 28.6k
[SPARK-5723][SQL]Change the default file format to Parquet for CTAS statements. #4639
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
… format/handler is specified.
Test build #27601 has started for PR 4639 at commit
|
Test build #27601 has finished for PR 4639 at commit
|
Test PASSed. |
Test build #27646 has started for PR 4639 at commit
|
}.getMessage | ||
assert( | ||
message.contains("Cannot specify database name in a CTAS statement"), | ||
"When spark.sql.sources.convertHiveCTASWithoutStorageSpec is true, we should not allow " + |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this message is out of date. Also, "should.." -> "do not currently allow the database name to be specified"
Test build #27646 has finished for PR 4639 at commit
|
Test PASSed. |
* and no SerDe is specified (no ROW FORMAT SERDE clause). | ||
*/ | ||
protected[sql] def convertCTAS: Boolean = | ||
getConf("spark.sql.hive.convertCTAS", "false") == "true" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit: .toBoolean
Test build #27666 has started for PR 4639 at commit
|
Test build #27666 has finished for PR 4639 at commit
|
Test PASSed. |
…tatements. JIRA: https://issues.apache.org/jira/browse/SPARK-5723 Author: Yin Huai <yhuai@databricks.com> This patch had conflicts when merged, resolved by Committer: Michael Armbrust <michael@databricks.com> Closes #4639 from yhuai/defaultCTASFileFormat and squashes the following commits: a568137 [Yin Huai] Merge remote-tracking branch 'upstream/master' into defaultCTASFileFormat ad2b07d [Yin Huai] Update tests and error messages. 8af5b2a [Yin Huai] Update conf key and unit test. 5a67903 [Yin Huai] Use data source write path for Hive's CTAS statements when no storage format/handler is specified. (cherry picked from commit e50934f) Signed-off-by: Michael Armbrust <michael@databricks.com>
JIRA: https://issues.apache.org/jira/browse/SPARK-5723