[SPARK-6747] [SQL] Throw an AnalysisException when unsupported Java list types used in Hive UDF by maropu · Pull Request #7248 · apache/spark

maropu · 2015-07-07T00:37:40Z

The current implementation can't handle List<> as a return type in Hive UDF and
throws meaningless Match Error.
We assume an UDF below;
public class UDFToListString extends UDF {
public List evaluate(Object o)
{ return Arrays.asList("xxx", "yyy", "zzz"); }
}
An exception of scala.MatchError is thrown as follows when the UDF used;
scala.MatchError: interface java.util.List (of class java.lang.Class)
at org.apache.spark.sql.hive.HiveInspectors$class.javaClassToDataType(HiveInspectors.scala:174)
at org.apache.spark.sql.hive.HiveSimpleUdf.javaClassToDataType(hiveUdfs.scala:76)
at org.apache.spark.sql.hive.HiveSimpleUdf.dataType$lzycompute(hiveUdfs.scala:106)
at org.apache.spark.sql.hive.HiveSimpleUdf.dataType(hiveUdfs.scala:106)
at org.apache.spark.sql.catalyst.expressions.Alias.toAttribute(namedExpressions.scala:131)
at org.apache.spark.sql.catalyst.planning.PhysicalOperation$$anonfun$collectAliases$1.applyOrElse(patterns.scala:95)
at org.apache.spark.sql.catalyst.planning.PhysicalOperation$$anonfun$collectAliases$1.applyOrElse(patterns.scala:94)
at scala.runtime.AbstractPartialFunction.apply(AbstractPartialFunction.scala:33)
at scala.collection.TraversableLike$$anonfun$collect$1.apply(TraversableLike.scala:278)
...
To make udf developers more understood, we need to throw a more suitable exception.

maropu · 2015-07-07T00:45:28Z

@marmbrus Through the discussion of #5395, I think it is hard to support java List<> types in SparkSQL because of type erasure. ISTM that if udf developers use this type, they'd be better to use GenericUDF interfaces instead of UDF ones. So, I re-created a PR to throw a meaningful exception when this kind of types used.

Any thought?

marmbrus · 2015-07-07T00:54:07Z

sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/HiveUDFSuite.scala

assign the result of this function to a variable and check that the message is correct.

Fixed and Does it satisfy your comment?

marmbrus · 2015-07-07T00:54:16Z

ok to test

marmbrus · 2015-07-07T00:54:41Z

This looks great! One minor comment on the tests.

maropu · 2015-07-07T01:15:52Z

@marmbrus Ok and thanks.
After this patch merged, I'll make a same patch for Map<> because it has the same issue.

SparkQA · 2015-07-07T02:36:32Z

Test build #36628 has finished for PR 7248 at commit 56305de.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2015-07-07T02:44:52Z

Thanks! Merging to master.

SparkQA · 2015-07-07T02:47:50Z

Test build #36629 has finished for PR 7248 at commit 1c3df2a.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

…ap<K,V> types used in Hive UDF To make UDF developers understood, throw an exception when unsupported Map<K,V> types used in Hive UDF. This fix is the same with #7248. Author: Takeshi YAMAMURO <linguin.m.s@gmail.com> Closes #7257 from maropu/ThrowExceptionWhenMapUsed and squashes the following commits: 916099a [Takeshi YAMAMURO] Fix style errors 7886dcc [Takeshi YAMAMURO] Throw an exception when Map<> used in Hive UDF

maropu added 18 commits May 15, 2015 14:59

Support List as a return type in Hive UDF

e553f10

Add a blank line at the end of UDFToListString

e21ce7e

Apply review comments

9406416

Fix code-style errors

f965c34

Remove a new type

1c7b9d1

Add StringToUtf8 to comvert String into UTF8String

a488712

Add TODO comments in UDFToListString of HiveUdfSuite

21e8763

Apply comments

1e82316

Support List as a return type in Hive UDF

ee232db

Add a blank line at the end of UDFToListString

93e3d4e

Apply review comments

6984bf4

Fix code-style errors

7f812fd

Remove a new type

af61f2e

Add StringToUtf8 to comvert String into UTF8String

fdb2ae4

Add TODO comments in UDFToListString of HiveUdfSuite

7114a47

Apply comments

2844a8e

Throw an exception when java list type used

92ed7a6

Fix conflicts

56305de

marmbrus reviewed Jul 7, 2015
View reviewed changes

Fix comments

1c3df2a

asfgit closed this in 1821fc1 Jul 7, 2015

maropu mentioned this pull request Jul 7, 2015

[SPARK-6912][SQL] Throw an AnalysisException when unsupported Java Map<K,V> types used in Hive UDF #7257

Closed

maropu deleted the FixBugInHiveInspectors branch July 5, 2017 11:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-6747] [SQL] Throw an AnalysisException when unsupported Java list types used in Hive UDF#7248

[SPARK-6747] [SQL] Throw an AnalysisException when unsupported Java list types used in Hive UDF#7248
maropu wants to merge 19 commits intoapache:masterfrom
maropu:FixBugInHiveInspectors

maropu commented Jul 7, 2015

Uh oh!

maropu commented Jul 7, 2015

Uh oh!

marmbrus Jul 7, 2015

Uh oh!

maropu Jul 7, 2015

Uh oh!

marmbrus commented Jul 7, 2015

Uh oh!

marmbrus commented Jul 7, 2015

Uh oh!

maropu commented Jul 7, 2015

Uh oh!

SparkQA commented Jul 7, 2015

Uh oh!

marmbrus commented Jul 7, 2015

Uh oh!

SparkQA commented Jul 7, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

maropu commented Jul 7, 2015

Uh oh!

maropu commented Jul 7, 2015

Uh oh!

marmbrus Jul 7, 2015

Choose a reason for hiding this comment

Uh oh!

maropu Jul 7, 2015

Choose a reason for hiding this comment

Uh oh!

marmbrus commented Jul 7, 2015

Uh oh!

marmbrus commented Jul 7, 2015

Uh oh!

maropu commented Jul 7, 2015

Uh oh!

SparkQA commented Jul 7, 2015

Uh oh!

marmbrus commented Jul 7, 2015

Uh oh!

SparkQA commented Jul 7, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants