[SNAP-2366] row buffer fault-in, forced rollover, merge small batches #1046

sumwale · 2018-06-04T22:17:20Z

Changes proposed in this pull request

add check for the two cases in table stats service:
- "large enough" row buffer (currently "large enough" is anything more than maxDeltaRows/8)
  that has not seen any updates/deletes since the last check; in this case schedule
  a task to force rollover the row buffer in column table
- also check if a bucket of column table has multiple small batches (non-transactional check);
  if so then submit a task to merge those after checking for transactional snapshot;
  merge is done by locally created ColumnTableScan->ColumnInsertExec plan where the scan
  uses an iterator only on the small batches
added a ColumnFormatStatsIterator that can take a bunch of stats rows and create an iterator
over just those (like required for batch merge)
added new scan metrics for disk reads: a) disk rows from row buffer,
b) partial column batches on disk, c) full column batches on disk
extended SQLMetrics types with a new SPLIT_SUM_METRIC that allows displaying multiple
metrics against a common name; ColumnTableScan now uses this to combine some metrics
else it becomes too large in display (especially for the newly added disk read metrics)
use hive-metadata (ExternalTableMetaData) to get number of rows instead of getting
from row buffer table (that is subject to changes in future)
added a metric for remote batch fetch
fix an NPE in SnappyTableStatsProviderService while filling up
result map from members since CHM cannot hold null values
use a common entry map in ColumnFormatIterator disk iteration instead of creating
separate for every column batch
added implementation of PURGE_CODEGEN_CACHES in StoreCallbacksImpl
limit to one task per table for background rolloverRowBuffer and mergeSmallBatches tasks

Patch testing

precheckin; new unit tests to be added next

ReleaseNotes.txt changes

NA

Other PRs

TIBCOSoftware/snappy-store#391

- add check for the two cases in table stats service: - "large enough" row buffer (currently "large enough" is anything more than maxDeltaRows/8) that has not seen any updates/deletes since the last check; in this case schedule a task to force rollover the row buffer in column table - also check if a bucket of column table has multiple small batches (non-transactional check); if so then submit a task to merge those after checking for transactional snapshot; merge is done by locally created ColumnTableScan->ColumnInsertExec plan where the scan uses an iterator only on the small batches - added a ColumnFormatStatsIterator that can take a bunch of stats rows and create an iterator over just those (like required for batch merge) - added new scan metrics for disk reads: a) disk rows from row buffer, b) partial column batches on disk, c) full column batches on disk - extended SQLMetrics types with a new SPLIT_SUM_METRIC that allows displaying multiple metrics against a common name; ColumnTableScan now uses this to combine some metrics else it becomes too large in display (especially for the newly added disk read metrics) - use hive-metadata (ExternalTableMetaData) to get number of rows instead of getting from row buffer table (that is subject to changes in future)

- fixed disk metrics collection added previously; set the metric correctly for both row buffer iterator (ResultSetTraversal) and ColumnFormatIterator - added a metric for remote batch fetch - fixed multiple ColumnTableScans causing split metrics to add up into one ColumnTableScan; now use a unique ID for split metrics for each ColumnTableScan instance - fix an NPE in SnappyTableStatsProviderService while filling up result map from members since CHM cannot hold null values - use a common entry map in ColumnFormatIterator disk iteration instead of creating separate for every column batch - added implementation of PURGE_CODEGEN_CACHES as StoreCallbacksImpl.clearCodegenCaches - limit to one task per table for background rolloverRowBuffer and mergeSmallBatches tasks - replaced a few usage of Map.put with justPut for koloboke maps

… replicated table

Conflicts: core/src/main/scala/io/snappydata/SnappyTableStatsProviderService.scala core/src/main/scala/io/snappydata/TableStatsProviderService.scala store

Conflicts: cluster/src/test/scala/io/snappydata/benchmark/TPCHColumnPartitionedTable.scala

rishitesh

First set of comments. Can see more after one of the points about updated & deleted columns is clarified.

rishitesh · 2018-06-06T11:34:46Z

core/src/main/scala/io/snappydata/SnappyTableStatsProviderService.scala

+    }
+  }
+
+  private def withExceptionHandling(f: => Unit, doFinally: () => Unit = null): Unit = {


This method can be moved to Utils class.

rishitesh · 2018-06-06T11:52:43Z

core/src/main/scala/io/snappydata/SnappyTableStatsProviderService.scala

+        logInfo(s"Found small batches in ${pr.getName}: ${smallBatches.map(_.getId)}")
+        val cache = pr.getGemFireCache
+        implicit val executionContext = Utils.executionContext(cache)
+        Future(withExceptionHandling({


There may be a situation where many such future tasks might pile up for the same Bucket region if the future thread does not get a chance to complete before next stats publish. It would be better if we mark the BucketRegion and exclude it from picking in progress buckets.

That cannot happen because this deliberately uses a "mergeTasks" map with "computeIfAbsent" that will add only one future for a PR at a time. The mergeTasks is cleared for that PR only at the end of Future execution. Same for rolloverTasks.

rishitesh · 2018-06-07T11:01:09Z

core/src/main/scala/io/snappydata/SnappyTableStatsProviderService.scala

+              partitionColumnAliases = Nil, baseRelation = null, schema, allFilters = Nil,
+              schemaAttrs, caseSensitive = true)
+            // reduce min delta row size to avoid going through rolloverRowBuffers again
+            val insertPlan = ColumnInsertExec(tableScan, Nil, Nil,


Can you please explain here, maybe in the comment section how this will handle updated columns and deleted columns?

The ColumnBatchIterator is passed the stats row entry. Rest all the columns, including delta/delete are looked up by the iterator (see ColumnFormatStatsIterator.getColumnValue) when the generated code asks for them.

So this will be same as iterating batches in ColumnTableScan that will return merged entries with deltas/deletes applied. The ColumnInsert is tied to output of this hence will create a combined merged batch.

Note that merging of deltas into main batch (when they become large enough) or deletes into main batch (when large number of entries are deleted) will be handled separately. This does not depend on main batch being small rather deltas being large. That merge needs to be done in the operation thread itself (that has created the last delta causing it to grow large). Right now only the case where all entries are deleted is handled.

Conflicts: core/src/main/scala/org/apache/spark/sql/store/CompressionCodecId.scala

…el with update/delete

added an OperationContext in SnappySession that can be used to persist context across multiple plan executions (e.g. caching for putInto, then actual execution)

Sumedh Wale and others added 6 commits May 30, 2018 05:42

Update store link

2972c4d

fix for SNAP-2365

ff3cca0

update store link

cd21c4d

minor change

a1dadd9

sumwale requested review from suranjan, rishitesh and kneeraj June 4, 2018 22:17

sumwale mentioned this pull request Jun 4, 2018

[SNAP-2366] row buffer fault-in, forced roll-over TIBCOSoftware/snappy-store#391

Open

Trilok Khairnar and others added 3 commits June 5, 2018 23:19

Fixing precheckin failure in SNAP-2365 and and adding similar fix for…

cf1eb84

… replicated table

Merge remote-tracking branch 'origin/master' into SNAP-2366

215c6e3

Conflicts: core/src/main/scala/io/snappydata/SnappyTableStatsProviderService.scala core/src/main/scala/io/snappydata/TableStatsProviderService.scala store

Merge remote-tracking branch 'origin/SNAP-2365' into SNAP-2366

7f69db0

Conflicts: cluster/src/test/scala/io/snappydata/benchmark/TPCHColumnPartitionedTable.scala

rishitesh reviewed Jun 7, 2018

View reviewed changes

Merge remote-tracking branch 'origin/master' into SNAP-2366

9595f20

Conflicts: core/src/main/scala/org/apache/spark/sql/store/CompressionCodecId.scala

sumwale changed the base branch from GITHUB-982 to master June 8, 2018 00:09

Sumedh Wale added 14 commits June 29, 2018 02:41

Merge remote-tracking branch 'origin/master' into SNAP-2366

88045eb

coarse container locking to fix rollover/merge when running in parall…

c858273

…el with update/delete

temp

0ea9eb9

Merge remote-tracking branch 'origin/master' into SNAP-2366

dbdbcb7

minor formatting change

c824cd6

fixes

bf26063

fix a ClassCast

f423108

Merge remote-tracking branch 'origin/master' into SNAP-2366

59588c0

fixing putInto which clears context prematurely

6ec25d0

added an OperationContext in SnappySession that can be used to persist context across multiple plan executions (e.g. caching for putInto, then actual execution)

fix COMMIT call

4905afc

Merge remote-tracking branch 'origin/master' into SNAP-2366

fa3f639

update store link

778bb4b

updates and fixes

647d243

Merge remote-tracking branch 'origin/master' into SNAP-2366

86f23d9

Sumedh Wale added 8 commits August 10, 2018 11:42

fix build issues after master merge

94c45d6

Merge remote-tracking branch 'origin/master' into SNAP-2366

4fb08bf

minor cleanups

e7960a9

Merge remote-tracking branch 'origin/master' into SNAP-2366

f2be757

Merge remote-tracking branch 'origin/master' into SNAP-2366

872bd8b

minor updates to tests

ce7fac9

Merge remote-tracking branch 'origin/master' into SNAP-2366

08a6956

Merge remote-tracking branch 'origin/master' into SNAP-2366

772c4af

ashetkar force-pushed the master branch from b73485e to f740fee Compare April 20, 2021 09:04

ashetkar force-pushed the SNAP-2366 branch from 4880ead to 772c4af Compare April 20, 2021 09:07

sumwale force-pushed the master branch from 1e636db to e1d45b2 Compare June 26, 2021 19:41

sumwale force-pushed the master branch from 8cc4798 to 5f5c15d Compare July 14, 2021 18:12

sumwale force-pushed the master branch 5 times, most recently from 8b43301 to 2b254d9 Compare October 1, 2021 09:23

sumwale force-pushed the master branch 5 times, most recently from 2c254f0 to 0f2888f Compare October 18, 2021 17:01

sumwale force-pushed the master branch 2 times, most recently from a466d26 to ea127bd Compare April 12, 2022 10:05

sumwale force-pushed the master branch 2 times, most recently from 99ec79c to c7b84fa Compare June 12, 2022 04:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SNAP-2366] row buffer fault-in, forced rollover, merge small batches #1046

[SNAP-2366] row buffer fault-in, forced rollover, merge small batches #1046

sumwale commented Jun 4, 2018

rishitesh left a comment

rishitesh Jun 6, 2018

sumwale Jun 8, 2018

rishitesh Jun 6, 2018

sumwale Jun 8, 2018

rishitesh Jun 7, 2018

sumwale Jun 8, 2018

[SNAP-2366] row buffer fault-in, forced rollover, merge small batches #1046

Are you sure you want to change the base?

[SNAP-2366] row buffer fault-in, forced rollover, merge small batches #1046

Conversation

sumwale commented Jun 4, 2018

Changes proposed in this pull request

Patch testing

ReleaseNotes.txt changes

Other PRs

rishitesh left a comment

Choose a reason for hiding this comment

rishitesh Jun 6, 2018

Choose a reason for hiding this comment

sumwale Jun 8, 2018

Choose a reason for hiding this comment

rishitesh Jun 6, 2018

Choose a reason for hiding this comment

sumwale Jun 8, 2018

Choose a reason for hiding this comment

rishitesh Jun 7, 2018

Choose a reason for hiding this comment

sumwale Jun 8, 2018

Choose a reason for hiding this comment