[SPARK-40470][SQL] Handle GetArrayStructFields and GetMapValue in "arrays_zip" function #37911

sadikovi · 2022-09-16T06:49:28Z

What changes were proposed in this pull request?

This is a follow-up for #37833.

The PR fixes column names in arrays_zip function for the cases when GetArrayStructFields and GetMapValue expressions are used (see unit tests for more details).

Before the patch, the column names would be indexes or an AnalysisException would be thrown in the case of GetArrayStructFields example.

Why are the changes needed?

Fixes an inconsistency issue in Spark 3.2 and onwards where the fields would be labeled as indexes instead of column names.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

I added unit tests that reproduce the issue and confirmed that the patch fixes them.

AmplabJenkins · 2022-09-16T12:55:54Z

Can one of the admins verify this patch?

HyukjinKwon · 2022-09-16T13:04:37Z

Merged to master, branch-3.3 and branch-3.2.

…rays_zip" function ### What changes were proposed in this pull request? This is a follow-up for #37833. The PR fixes column names in `arrays_zip` function for the cases when `GetArrayStructFields` and `GetMapValue` expressions are used (see unit tests for more details). Before the patch, the column names would be indexes or an AnalysisException would be thrown in the case of `GetArrayStructFields` example. ### Why are the changes needed? Fixes an inconsistency issue in Spark 3.2 and onwards where the fields would be labeled as indexes instead of column names. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? I added unit tests that reproduce the issue and confirmed that the patch fixes them. Closes #37911 from sadikovi/SPARK-40470. Authored-by: Ivan Sadikov <ivan.sadikov@databricks.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org> (cherry picked from commit 9b0f979) Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>

sadikovi · 2022-09-18T23:05:27Z

Thank you, @HyukjinKwon!

…rays_zip" function ### What changes were proposed in this pull request? This is a follow-up for apache#37833. The PR fixes column names in `arrays_zip` function for the cases when `GetArrayStructFields` and `GetMapValue` expressions are used (see unit tests for more details). Before the patch, the column names would be indexes or an AnalysisException would be thrown in the case of `GetArrayStructFields` example. ### Why are the changes needed? Fixes an inconsistency issue in Spark 3.2 and onwards where the fields would be labeled as indexes instead of column names. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? I added unit tests that reproduce the issue and confirmed that the patch fixes them. Closes apache#37911 from sadikovi/SPARK-40470. Authored-by: Ivan Sadikov <ivan.sadikov@databricks.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>

…rays_zip" function ### What changes were proposed in this pull request? This is a follow-up for apache#37833. The PR fixes column names in `arrays_zip` function for the cases when `GetArrayStructFields` and `GetMapValue` expressions are used (see unit tests for more details). Before the patch, the column names would be indexes or an AnalysisException would be thrown in the case of `GetArrayStructFields` example. ### Why are the changes needed? Fixes an inconsistency issue in Spark 3.2 and onwards where the fields would be labeled as indexes instead of column names. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? I added unit tests that reproduce the issue and confirmed that the patch fixes them. Closes apache#37911 from sadikovi/SPARK-40470. Authored-by: Ivan Sadikov <ivan.sadikov@databricks.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org> (cherry picked from commit 9b0f979) Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>

fix map and array

4d42322

github-actions bot added the SQL label Sep 16, 2022

HyukjinKwon approved these changes Sep 16, 2022

View reviewed changes

HyukjinKwon changed the title ~~[SPARK-40470] Handle GetArrayStructFields and GetMapValue in "arrays_zip" function~~ [SPARK-40470][SQL] Handle GetArrayStructFields and GetMapValue in "arrays_zip" function Sep 16, 2022

HyukjinKwon closed this in 9b0f979 Sep 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-40470][SQL] Handle GetArrayStructFields and GetMapValue in "arrays_zip" function #37911

[SPARK-40470][SQL] Handle GetArrayStructFields and GetMapValue in "arrays_zip" function #37911

sadikovi commented Sep 16, 2022

AmplabJenkins commented Sep 16, 2022

HyukjinKwon commented Sep 16, 2022 •

edited

Loading

sadikovi commented Sep 18, 2022

[SPARK-40470][SQL] Handle GetArrayStructFields and GetMapValue in "arrays_zip" function #37911

[SPARK-40470][SQL] Handle GetArrayStructFields and GetMapValue in "arrays_zip" function #37911

Conversation

sadikovi commented Sep 16, 2022

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

AmplabJenkins commented Sep 16, 2022

HyukjinKwon commented Sep 16, 2022 • edited Loading

sadikovi commented Sep 18, 2022

HyukjinKwon commented Sep 16, 2022 •

edited

Loading