[SPARK-20897][SQL] cached self-join should not fail #18121

cloud-fan · 2017-05-26T15:16:08Z

What changes were proposed in this pull request?

The failed test case is, we have a SortMergeJoinExec for a self-join, which means we have a ReusedExchange node in the query plan. It works fine without caching, but throws an exception in SortMergeJoinExec.outputPartitioning if we cache it.

The root cause is, ReusedExchange doesn't propagate the output partitioning from its child, so in SortMergeJoinExec.outputPartitioning we create PartitioningCollection with a hash partitioning and an unknown partitioning, and fail.

This bug is mostly fine, because inserting the ReusedExchange is the last step to prepare the physical plan, we won't call SortMergeJoinExec.outputPartitioning anymore after this.

However, if the dataframe is cached, the physical plan of it becomes InMemoryTableScanExec, which contains another physical plan representing the cached query, and it has gone through the entire planning phase and may have ReusedExchange. Then the planner call InMemoryTableScanExec.outputPartitioning, which then calls SortMergeJoinExec.outputPartitioning and trigger this bug.

How was this patch tested?

a new regression test

cloud-fan · 2017-05-26T15:17:19Z

cc @gatorsmile @davies

viirya · 2017-05-26T15:41:39Z

sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/Exchange.scala

@@ -58,6 +59,10 @@ case class ReusedExchangeExec(override val output: Seq[Attribute], child: Exchan
  override protected[sql] def doExecuteBroadcast[T](): broadcast.Broadcast[T] = {
    child.executeBroadcast()
  }
+
+  override def outputPartitioning: Partitioning = child.outputPartitioning


ReusedExchangeExec can have distinct sets of output attribute ids. Shall we also update outputPartitioning and outputOrdering, if its output is different to child.output.

good catch!

viirya · 2017-05-26T15:42:08Z

LGTM except one comment.

SparkQA · 2017-05-26T17:27:14Z

Test build #77426 has finished for PR 18121 at commit ca4a3d1.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2017-05-27T03:34:29Z

Test build #77445 has finished for PR 18121 at commit e91311c.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile

LGTM pending Jenkins

SparkQA · 2017-05-27T09:38:13Z

Test build #77454 has finished for PR 18121 at commit 460f072.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2017-05-27T23:16:14Z

Thanks! Merging to master/2.2.

## What changes were proposed in this pull request? The failed test case is, we have a `SortMergeJoinExec` for a self-join, which means we have a `ReusedExchange` node in the query plan. It works fine without caching, but throws an exception in `SortMergeJoinExec.outputPartitioning` if we cache it. The root cause is, `ReusedExchange` doesn't propagate the output partitioning from its child, so in `SortMergeJoinExec.outputPartitioning` we create `PartitioningCollection` with a hash partitioning and an unknown partitioning, and fail. This bug is mostly fine, because inserting the `ReusedExchange` is the last step to prepare the physical plan, we won't call `SortMergeJoinExec.outputPartitioning` anymore after this. However, if the dataframe is cached, the physical plan of it becomes `InMemoryTableScanExec`, which contains another physical plan representing the cached query, and it has gone through the entire planning phase and may have `ReusedExchange`. Then the planner call `InMemoryTableScanExec.outputPartitioning`, which then calls `SortMergeJoinExec.outputPartitioning` and trigger this bug. ## How was this patch tested? a new regression test Author: Wenchen Fan <wenchen@databricks.com> Closes #18121 from cloud-fan/bug. (cherry picked from commit 08ede46) Signed-off-by: Xiao Li <gatorsmile@gmail.com>

yucai · 2018-06-14T02:31:52Z

sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/Exchange.scala

+    case h: HashPartitioning => h.copy(expressions = h.expressions.map(updateAttr))
+    case other => other
+  }
+


@cloud-fan @viirya Could you help explain why we only consider HashPartitioning here?
How about RangePartitioning?

viirya reviewed May 26, 2017

View reviewed changes

cloud-fan force-pushed the bug branch from ca4a3d1 to e91311c Compare May 27, 2017 03:29

gatorsmile approved these changes May 27, 2017

View reviewed changes

cached self-join should not fail

460f072

cloud-fan force-pushed the bug branch from e91311c to 460f072 Compare May 27, 2017 07:25

asfgit closed this in 08ede46 May 27, 2017

yucai reviewed Jun 14, 2018

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-20897][SQL] cached self-join should not fail #18121

[SPARK-20897][SQL] cached self-join should not fail #18121

Uh oh!

cloud-fan commented May 26, 2017

Uh oh!

cloud-fan commented May 26, 2017

Uh oh!

viirya May 26, 2017

Uh oh!

cloud-fan May 27, 2017

Uh oh!

viirya commented May 26, 2017

Uh oh!

SparkQA commented May 26, 2017

Uh oh!

SparkQA commented May 27, 2017

Uh oh!

gatorsmile left a comment

Uh oh!

SparkQA commented May 27, 2017

Uh oh!

gatorsmile commented May 27, 2017

Uh oh!

yucai Jun 14, 2018

Uh oh!

Uh oh!

[SPARK-20897][SQL] cached self-join should not fail #18121

[SPARK-20897][SQL] cached self-join should not fail #18121

Uh oh!

Conversation

cloud-fan commented May 26, 2017

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

cloud-fan commented May 26, 2017

Uh oh!

viirya May 26, 2017

Choose a reason for hiding this comment

Uh oh!

cloud-fan May 27, 2017

Choose a reason for hiding this comment

Uh oh!

viirya commented May 26, 2017

Uh oh!

SparkQA commented May 26, 2017

Uh oh!

SparkQA commented May 27, 2017

Uh oh!

gatorsmile left a comment

Choose a reason for hiding this comment

Uh oh!

SparkQA commented May 27, 2017

Uh oh!

gatorsmile commented May 27, 2017

Uh oh!

yucai Jun 14, 2018

Choose a reason for hiding this comment

Uh oh!

Uh oh!