feat: intial hudi reg test #3641

rahil-c · 2026-02-02T05:13:53Z

Summary

Adding intial regression test for polaris hudi integration, following exist pattern set by Delta regression test
Made changes in run.sh and setup.sh in order to ensure that spark session can be started correctly depending on the table format.
Ran locally both delta regression test and hudi regression test to ensure they pass.

Checklist

🛡️ Don't disclose security issues! (contact security@apache.org)
🔗 Clearly explained why the changes are needed, or linked related issues: Fixes #
🧪 Added/updated tests with good coverage, or manually tested (and explained how)
💡 Added comments for complex logic
🧾 Updated CHANGELOG.md (if needed)
📚 Updated documentation in site/content/in-dev/unreleased (if needed)

dimas-b

Thanks for your contribution, @rahil-c ! I'm juts wondering if using shell might be an overkill for this test. Specific comment thread below.

dimas-b · 2026-02-02T17:18:22Z

plugins/spark/v3.5/regtests/spark_hudi.sh

+  "http://${POLARIS_HOST:-localhost}:8181/api/catalog/v1/config?warehouse=${CATALOG_NAME}"
+echo
+echo "Catalog created"
+cat << EOF | ${SPARK_HOME}/bin/spark-sql -S \


Would it be possible to run this test as an Integration test under JUnit inside the Gradle builld?

Hi @dimas-b we currently have the following integration test for polaris hudi here: #3194

In terms of the reg test, I was the following the shell pattern that @gh-yzou had done for Delta, now for Hudi.

Hi @dimas-b The purpose of this regression test is to validate the end-to-end user experience when using Spark with both --packages and --jars. This is an important scenario that cannot be fully covered by integration tests.
While it is true that this test is relatively expensive, that is why it includes only very basic test cases. More complex scenarios and edge cases are covered by integration tests, which provide a more cost-effective approach.

gh-yzou · 2026-02-09T05:44:24Z

plugins/spark/v3.5/regtests/run.sh

+# Define test suites to run
+# Each suite specifies: test_file:table_format:test_shortname
+declare -a TEST_SUITES=(
+  "spark_sql.sh:delta:spark_sql"


How about let's enforce the test file name to the format like xxx_<table_format>.sh, and have a separate folder to include all test src file and reference file. Then we just need to list the folder to get all test files, and extract the table format by parsing the file name. The benefit would be easy to onboard new tests, and developer doesn't have to input a long string when running single test (just the file name)

gh-yzou · 2026-02-09T05:45:20Z

plugins/spark/v3.5/regtests/setup.sh

 # this is mostly useful for building the Docker image with all needed dependencies
-${SPARK_HOME}/bin/spark-sql -e "SELECT 1"
+if [[ "$TABLE_FORMAT" == "hudi" ]]; then
+  # For Hudi: Pass --packages on command line to match official Hudi docs approach


i don't think we need the if else here anymore

gh-yzou · 2026-02-09T05:45:27Z

plugins/spark/v3.5/regtests/spark_hudi.sh

+rm -rf /tmp/spark_hudi_catalog/
+
+curl -i -X DELETE -H "Authorization: Bearer ${SPARK_BEARER_TOKEN}" -H 'Accept: application/json' -H 'Content-Type: application/json' \
+  http://${POLARIS_HOST:-localhost}:8181/api/management/v1/catalogs/${CATALOG_NAME} > /dev/stderr


feat: intial hudi reg test

b7e7556

github-project-automation bot added this to Basic Kanban Board Feb 2, 2026

github-project-automation bot moved this to PRs In Progress in Basic Kanban Board Feb 2, 2026

dimas-b reviewed Feb 2, 2026

View reviewed changes

rahil-c added 2 commits February 8, 2026 17:34

try patch

d3330cd

remove comments

a72471e

gh-yzou reviewed Feb 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: intial hudi reg test #3641

feat: intial hudi reg test #3641

rahil-c commented Feb 2, 2026

Uh oh!

dimas-b left a comment

Uh oh!

dimas-b Feb 2, 2026

Uh oh!

rahil-c Feb 9, 2026 •

edited

Loading

Uh oh!

gh-yzou Feb 9, 2026

Uh oh!

gh-yzou Feb 9, 2026

Uh oh!

gh-yzou Feb 9, 2026

Uh oh!

gh-yzou Feb 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: intial hudi reg test #3641

Are you sure you want to change the base?

feat: intial hudi reg test #3641

Conversation

rahil-c commented Feb 2, 2026

Summary

Checklist

Uh oh!

dimas-b left a comment

Choose a reason for hiding this comment

Uh oh!

dimas-b Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

rahil-c Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gh-yzou Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

gh-yzou Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

gh-yzou Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

gh-yzou Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rahil-c Feb 9, 2026 •

edited

Loading