Skip to content

Conversation

@asl3
Copy link
Contributor

@asl3 asl3 commented Dec 3, 2025

What changes were proposed in this pull request?

Skip doctest to restore pyspark-pandas CI, while docs are being updated for arrow by default

Why are the changes needed?

Restore the CI

Does this PR introduce any user-facing change?

No

How was this patch tested?

CI

Was this patch authored or co-authored using generative AI tooling?

No

@asl3 asl3 changed the title [SPARK-54555][PYTHON][TESTS][FOLLOW-UP] Skip doctest [SPARK-54555][PYTHON][TESTS][FOLLOW-UP] Skip pyspark-pandas doctest Dec 3, 2025
@zhengruifeng
Copy link
Contributor

merged to master

zhengruifeng pushed a commit that referenced this pull request Dec 4, 2025
…ndlingMode` in pyspark pandas doctest

### What changes were proposed in this pull request?

After #53299, explicitly set conf `spark.sql.execution.pandas.structHandlingMode` to `row`. This is needed because when Arrow optimization was previously disabled, structHandlingMode converted to Row object by default, but when Arrow optimization is enabled, it converts to dict or raise an Exception if duplicated nested field names.

To match the docs behavior after enabling arrow by default, we explicitly set this conf to row.

### Why are the changes needed?

Fix pyspark-pandas doctest and remove the skip of doctests

### Does this PR introduce _any_ user-facing change?

No

### How was this patch tested?

CI running pyspark-pandas doctest

### Was this patch authored or co-authored using generative AI tooling?

No

Closes #53301 from asl3/pysparkpandasdoctest.

Authored-by: Amanda Liu <amanda.liu@databricks.com>
Signed-off-by: Ruifeng Zheng <ruifengz@apache.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants