[SPARK-46122][SQL] Set `spark.sql.legacy.createHiveTableByDefault` to `false` by default #46207

dongjoon-hyun · 2024-04-24T16:59:19Z

What changes were proposed in this pull request?

This PR aims to switch spark.sql.legacy.createHiveTableByDefault to false by default in order to move away from this legacy behavior from Apache Spark 4.0.0 while the legacy functionality will be preserved during Apache Spark 4.x period by setting spark.sql.legacy.createHiveTableByDefault=true.

Why are the changes needed?

Historically, this behavior change was merged at Apache Spark 3.0.0 activity in SPARK-30098 and reverted officially during the 3.0.0 RC period.

At Apache Spark 3.1.0, we had another discussion and defined it as Legacy behavior via a new configuration by reusing the JIRA ID, SPARK-30098.

Last year, this was proposed again twice and Apache Spark 4.0.0 is a good time to make a decision for Apache Spark future direction.

SPARK-42603 on 2023-02-27 as an independent idea.
SPARK-46122 on 2023-11-27 as a part of Apache Spark 4.0.0 idea

Does this PR introduce any user-facing change?

Yes, the migration document is updated.

How was this patch tested?

Pass the CIs with the adjusted test cases.

Was this patch authored or co-authored using generative AI tooling?

No.

…by default

yaooqinn

LGTM

dongjoon-hyun · 2024-04-25T03:15:12Z

Thank you, @yaooqinn . I'll throw a discussion thread for this Tonight.

dongjoon-hyun · 2024-04-26T17:00:52Z

I started a vote for this PR too.

https://lists.apache.org/thread/x09gynt90v3hh5sql1gt9dlcn6m6699p

dongjoon-hyun · 2024-04-30T08:00:30Z

Hi, @cloud-fan , @yaooqinn , @ulysses-you . If you don't mind, could you participate the vote? :)

dongjoon-hyun · 2024-04-30T08:44:16Z

Thank you all. Votes passed.

https://lists.apache.org/thread/65h92lc4mp1d6l6f00xfnlh586for05g

dongjoon-hyun · 2024-04-30T08:45:03Z

Merged to master for Apache Spark 4.0.0.

dongjoon-hyun marked this pull request as draft April 24, 2024 16:59

dongjoon-hyun changed the title ~~[SPARK-46122][SQL] Disable spark.sql.legacy.createHiveTableByDefault by default~~ [SPARK-46122][SQL] Disable spark.sql.legacy.createHiveTableByDefault by default Apr 24, 2024

github-actions bot added the SQL label Apr 24, 2024

dongjoon-hyun changed the title ~~[SPARK-46122][SQL] Disable spark.sql.legacy.createHiveTableByDefault by default~~ [SPARK-46122][SQL] Set spark.sql.legacy.createHiveTableByDefault to false by default Apr 24, 2024

github-actions bot added the DOCS label Apr 24, 2024

dongjoon-hyun force-pushed the SPARK-46122 branch from a625607 to f20d414 Compare April 24, 2024 22:03

github-actions bot added the PYTHON label Apr 24, 2024

[SPARK-46122][SQL] Disable spark.sql.legacy.createHiveTableByDefault …

29d1be5

…by default

dongjoon-hyun force-pushed the SPARK-46122 branch from 0e02a29 to 29d1be5 Compare April 25, 2024 01:59

dongjoon-hyun marked this pull request as ready for review April 25, 2024 01:59

yaooqinn approved these changes Apr 25, 2024

View reviewed changes

cloud-fan approved these changes Apr 25, 2024

View reviewed changes

LuciferYang approved these changes Apr 26, 2024

View reviewed changes

ulysses-you approved these changes Apr 26, 2024

View reviewed changes

HyukjinKwon approved these changes Apr 28, 2024

View reviewed changes

dongjoon-hyun closed this in 9e8c4aa Apr 30, 2024

dongjoon-hyun deleted the SPARK-46122 branch April 30, 2024 08:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-46122][SQL] Set `spark.sql.legacy.createHiveTableByDefault` to `false` by default #46207

[SPARK-46122][SQL] Set `spark.sql.legacy.createHiveTableByDefault` to `false` by default #46207

Uh oh!

dongjoon-hyun commented Apr 24, 2024 •

edited

Loading

Uh oh!

yaooqinn left a comment

Uh oh!

dongjoon-hyun commented Apr 25, 2024

Uh oh!

dongjoon-hyun commented Apr 26, 2024

Uh oh!

dongjoon-hyun commented Apr 30, 2024

Uh oh!

dongjoon-hyun commented Apr 30, 2024

Uh oh!

dongjoon-hyun commented Apr 30, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[SPARK-46122][SQL] Set spark.sql.legacy.createHiveTableByDefault to false by default #46207

[SPARK-46122][SQL] Set spark.sql.legacy.createHiveTableByDefault to false by default #46207

Uh oh!

Conversation

dongjoon-hyun commented Apr 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

yaooqinn left a comment

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun commented Apr 25, 2024

Uh oh!

dongjoon-hyun commented Apr 26, 2024

Uh oh!

dongjoon-hyun commented Apr 30, 2024

Uh oh!

dongjoon-hyun commented Apr 30, 2024

Uh oh!

dongjoon-hyun commented Apr 30, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[SPARK-46122][SQL] Set `spark.sql.legacy.createHiveTableByDefault` to `false` by default #46207

[SPARK-46122][SQL] Set `spark.sql.legacy.createHiveTableByDefault` to `false` by default #46207

dongjoon-hyun commented Apr 24, 2024 •

edited

Loading