Skip to content

Conversation

@xieshuaihu
Copy link

What changes were proposed in this pull request?

Fix a bug in pyspark connect.

DataFrameWriterV2.overwritePartitions set mode as overwrite_partitions [pyspark/sql/connect/readwriter.py, line 825], but WirteOperationV2 take it as overwrite_partition [pyspark/sql/connect/plan.py, line 1660]

Why are the changes needed?

make dataframe.writeTo(table).overwritePartitions() work

Does this PR introduce any user-facing change?

No

How was this patch tested?

No test. This bug is very obvious.

Was this patch authored or co-authored using generative AI tooling?

No

@HyukjinKwon
Copy link
Member

@xieshuaihu xieshuaihu force-pushed the python_connect_overwrite branch from 6d4ae12 to b04efc5 Compare October 16, 2023 05:59
@HyukjinKwon
Copy link
Member

HyukjinKwon commented Oct 16, 2023

Merged to master, and branch-3.5.

HyukjinKwon pushed a commit that referenced this pull request Oct 16, 2023
Fix a bug in pyspark connect.

DataFrameWriterV2.overwritePartitions set mode as overwrite_partitions [pyspark/sql/connect/readwriter.py, line 825], but WirteOperationV2 take it as overwrite_partition [pyspark/sql/connect/plan.py, line 1660]

make dataframe.writeTo(table).overwritePartitions() work

No

No test. This bug is very obvious.

No

Closes #43367 from xieshuaihu/python_connect_overwrite.

Authored-by: xieshuaihu <xieshuaihu@agora.io>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
(cherry picked from commit 9bdad31)
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
@xieshuaihu xieshuaihu deleted the python_connect_overwrite branch November 9, 2023 08:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants