[SPARK-33463][SQL] Keep Job Id during incremental collect in Spark Thrift Server #30390

gumartinm · 2020-11-16T22:02:40Z

What changes were proposed in this pull request?

When enabling spark.sql.thriftServer.incrementalCollect Job Ids get lost and tracing queries in Spark Thrift Server ends up being too complicated.

Why are the changes needed?

Because it will make easier tracing Spark Thrift Server queries.

Does this PR introduce any user-facing change?

No

How was this patch tested?

The current tests are enough. No need of more tests.

…ental collect

AmplabJenkins · 2020-11-16T22:05:43Z

Can one of the admins verify this patch?

juliuszsompolski · 2020-11-18T09:17:48Z

...r/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkExecuteStatementOperation.scala

@@ -126,6 +126,17 @@ private[hive] class SparkExecuteStatementOperation(
  }

  def getNextRowSet(order: FetchOrientation, maxRowsL: Long): RowSet = withLocalProperties {
+    try {
+      sqlContext.sparkContext.setJobGroup(statementId, statement)


minor: in execute substitutorStatement is used as a description of the jobgroup. Maybe pull out substitutorStatement to be a class field and use it here as well?

You are right!!! Thanks a lot for taking the time of revisioning my code.

I implemented the change, hopefully now it looks better. Looking forward to hearing your thoughts.

juliuszsompolski

LGTM
cc @wangyum

dongjoon-hyun

+1, LGTM. Thank you, @gumartinm and @juliuszsompolski .
Merged to master for Apache Spark 3.1.

dongjoon-hyun · 2020-11-21T16:41:01Z

@gumartinm . I added you to the Apache Spark contributor group and assigned SPARK-33463 to you.
Thank you for your first contribution again. Welcome to the Apache Spark community.

[SPARK-33463][SQL] Spark Thrift Server, keep Job Id when using increm…

64dbb73

…ental collect

github-actions bot added the SQL label Nov 16, 2020

gumartinm added 2 commits November 16, 2020 23:44

[SPARK-33463][SQL] getNextRowSetInternal: fixing private function scope

dde6d91

[SPARK-33463][SQL] getNextRowSetInternal: fixing private function scope

ac2791a

juliuszsompolski reviewed Nov 18, 2020

View reviewed changes

Using substitutorStatement as private class field

327fdf8

juliuszsompolski approved these changes Nov 19, 2020

View reviewed changes

dongjoon-hyun approved these changes Nov 21, 2020

View reviewed changes

dongjoon-hyun changed the title ~~[SPARK-33463][SQL] Spark Thrift Server, keep Job Id when using incremental collect~~ [SPARK-33463][SQL] Keep Job Id during incremental collect in Spark Thrift Server Nov 21, 2020

dongjoon-hyun closed this in 517b810 Nov 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-33463][SQL] Keep Job Id during incremental collect in Spark Thrift Server #30390

[SPARK-33463][SQL] Keep Job Id during incremental collect in Spark Thrift Server #30390

Uh oh!

gumartinm commented Nov 16, 2020

Uh oh!

AmplabJenkins commented Nov 16, 2020

Uh oh!

juliuszsompolski Nov 18, 2020

Uh oh!

gumartinm Nov 19, 2020

Uh oh!

juliuszsompolski left a comment

Uh oh!

dongjoon-hyun left a comment

Uh oh!

dongjoon-hyun commented Nov 21, 2020

Uh oh!

Uh oh!

[SPARK-33463][SQL] Keep Job Id during incremental collect in Spark Thrift Server #30390

[SPARK-33463][SQL] Keep Job Id during incremental collect in Spark Thrift Server #30390

Uh oh!

Conversation

gumartinm commented Nov 16, 2020

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

AmplabJenkins commented Nov 16, 2020

Uh oh!

juliuszsompolski Nov 18, 2020

Choose a reason for hiding this comment

Uh oh!

gumartinm Nov 19, 2020

Choose a reason for hiding this comment

Uh oh!

juliuszsompolski left a comment

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun commented Nov 21, 2020

Uh oh!

Uh oh!