[SPARK-32629][SQL] Track metrics of BitSet/OpenHashSet in full outer SHJ #29566

c21 · 2020-08-28T01:19:12Z

What changes were proposed in this pull request?

This is followup from #29342, where to do two things:

Per [SPARK-32399][SQL] Full outer shuffled hash join #29342 (comment), change from java HashSet to spark in-house OpenHashSet to track matched rows for non-unique join keys. I checked OpenHashSet implementation which is built from a key index (OpenHashSet._bitset as BitSet) and key array (OpenHashSet._data as Array). Java HashSet is built from HashMap, which stores value in Node linked list and by theory should have taken more memory than OpenHashSet. Reran the same benchmark query used in [SPARK-32399][SQL] Full outer shuffled hash join #29342, and verified the query has similar performance here between HashSet and OpenHashSet.
Track metrics of the extra data structure BitSet/OpenHashSet for full outer SHJ. This depends on above thing, because there seems no easy way to get java HashSet memory size.

Why are the changes needed?

To better surface the memory usage for full outer SHJ more accurately.
This can help users/developers to debug/improve full outer SHJ.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Added unite test in SQLMetricsSuite.scala .

c21 · 2020-08-28T01:22:06Z

cc @maropu and @cloud-fan if you guys have time to take a look, thanks.

maropu · 2020-08-28T01:46:57Z

sql/core/src/test/scala/org/apache/spark/sql/execution/metric/SQLMetricsTestUtils.scala

+   * @param plan `SparkPlan` operator to check metrics
+   * @param expectedMetrics the expected metrics. The format is `metric name -> metric value`.
+   */
+  protected def testMetricsInSparkPlanOperator(


We need to put this func here instead of SQLMetricsSuite?

@maropu - I am following the convention like other method e.g. testSparkPlanMetrics.

maropu · 2020-08-28T01:48:40Z

sql/core/src/main/scala/org/apache/spark/sql/execution/joins/ShuffledHashJoinExec.scala

-    // in full outer shuffled hash join
    val matchedKeys = new BitSet(hashedRelation.maxNumKeysIndex)
+    val buildDataSize = longMetric("buildDataSize")
+    buildDataSize += matchedKeys.capacity / 8


nit: longMetric("buildDataSize") += matchedKeys.capacity / 8?

@maropu - sure, updated.

maropu · 2020-08-28T01:50:04Z

Looks okay if the tests pass. also cc: @agrawaldevesh

maropu · 2020-08-28T01:53:18Z

Basically, we don't need the [FOLLOWUP] tag for a new ticket, I think.

c21 · 2020-08-28T01:53:56Z

Basically, we don't need the [FOLLOWUP] tag for a new ticket, I think.

@maropu - cool, thanks.

agrawaldevesh · 2020-08-28T02:00:33Z

Wow, cool. So any improvement in either memory usage, GC, or CPU time by switching to open hashset ?

cloud-fan · 2020-08-28T04:49:35Z

sql/core/src/main/scala/org/apache/spark/sql/execution/joins/ShuffledHashJoinExec.scala

   * 1. Process rows from stream side by looking up hash relation.
   *    Mark the matched rows from build side be looked up.
-   *    A `BitSet` is used to track matched rows with key index.
+   *    A [[BitSet]] is used to track matched rows with key index.


nit: just say A bit set is ... to be generic.

@cloud-fan - sure, updated.

cloud-fan · 2020-08-28T04:50:55Z

sql/core/src/main/scala/org/apache/spark/sql/execution/joins/ShuffledHashJoinExec.scala

   * 1. Process rows from stream side by looking up hash relation.
   *    Mark the matched rows from build side be looked up.
-   *    A `BitSet` is used to track matched rows with key index.
+   *    A [[BitSet]] is used to track matched rows with key index.


nit: just say A bit set is ... to be generic.

@cloud-fan - sure, updated.

SparkQA · 2020-08-28T06:11:16Z

Test build #127975 has finished for PR 29566 at commit 4a35fa6.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

c21 · 2020-08-28T09:08:01Z

So any improvement in either memory usage, GC, or CPU time by switching to open hashset ?

@agrawaldevesh - here is report for one production query internally which is doing a FULL OUTER JOIN for one large table, and one small table. Couldn't share the exact query text and data but the query shape is:

INSERT OVERWRITE TABLE output_table
PARTITION (...)
SELECT ...
FROM large_table a
FULL OUTER JOIN small_table b
ON a.col_x = f.col_y

input metrics:
large_table (ORC format): uncompressed data input size: 54TB
small_table (ORC format): uncompressed data input size: 85GB

execution metrics:

Number of tasks per stage:
stage 0: 40547 (read large table)
stage 1: 158 (read small table)
stage 2: 5063 (SHJ and insert to output table)

Total shuffle bytes across executors: 15.1TB.

Query type	Aggregated executors CPU time (ms)	Aggregated executors GC time (ms)
use java `HashSet`	3.48 B	124.0 M
use spark `OpenHashSet`	3.22 B	91.2 M

TLDR is switching to OpenHashSet, we are seeing 7% CPU reduction and 27% GC time reduction.

c21 · 2020-08-28T09:09:59Z

Address all comments and PR is ready for review again, thanks.

SparkQA · 2020-08-28T13:29:15Z

Test build #127992 has finished for PR 29566 at commit b932caa.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

maropu · 2020-08-29T22:02:02Z

Thanks! Merged to master.

c21 · 2020-08-29T22:28:36Z

Thanks @maropu , @cloud-fan and @agrawaldevesh for discussion and review!

Track metrics of BitSet/OpenHashSet in full outer SHJ

4a35fa6

probot-autolabeler bot added the SQL label Aug 28, 2020

c21 changed the title ~~[SPARK-32629] Track metrics of BitSet/OpenHashSet in full outer SHJ~~ [SPARK-32629][SQL][FOLLOWUP] Track metrics of BitSet/OpenHashSet in full outer SHJ Aug 28, 2020

maropu reviewed Aug 28, 2020

View reviewed changes

maropu changed the title ~~[SPARK-32629][SQL][FOLLOWUP] Track metrics of BitSet/OpenHashSet in full outer SHJ~~ [SPARK-32629][SQL] Track metrics of BitSet/OpenHashSet in full outer SHJ Aug 28, 2020

cloud-fan reviewed Aug 28, 2020

View reviewed changes

Address all comments

b932caa

cloud-fan approved these changes Aug 28, 2020

View reviewed changes

maropu approved these changes Aug 29, 2020

View reviewed changes

maropu closed this in cfe012a Aug 29, 2020

c21 deleted the add-metrics branch August 29, 2020 22:29

[SPARK-32629][SQL] Track metrics of BitSet/OpenHashSet in full outer SHJ #29566

[SPARK-32629][SQL] Track metrics of BitSet/OpenHashSet in full outer SHJ #29566

Uh oh!

Conversation

c21 commented Aug 28, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

c21 commented Aug 28, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

maropu commented Aug 28, 2020

Uh oh!

maropu commented Aug 28, 2020

Uh oh!

c21 commented Aug 28, 2020

Uh oh!

agrawaldevesh commented Aug 28, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Aug 28, 2020

Uh oh!

c21 commented Aug 28, 2020

Uh oh!

c21 commented Aug 28, 2020

Uh oh!

SparkQA commented Aug 28, 2020

Uh oh!

maropu commented Aug 29, 2020

Uh oh!

c21 commented Aug 29, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

c21 commented Aug 28, 2020 •

edited

Loading