[CARBONDATA-861] Improvements in query #709

ravipesala · 2017-03-29T19:00:51Z

Following are the list of improvements done in this part of PR.

Removed multiple creation of array and copy of it in Dimension and measure chunk readers.
Simplified logic of finding offsets of nodictionary keys in the class SafeVariableLengthDimensionDataChunkStore.
Avoided byte array creation and copy for nodictionary columns in case of vectorized reader. Instead directly sending the length and offset to vector.
Removed unnecessary decoder plan additions to oprtimized plan. It can optimize the codegen flow.
Updated CompareTest to take table blocksize and kept as 32 Mb in order to make use of small sorting when doing take ordered in spark.

asfbot · 2017-03-29T19:00:53Z

Can one of the admins verify this patch?

asfbot · 2017-03-29T19:00:53Z

Can one of the admins verify this patch?

asfbot · 2017-03-29T19:00:53Z

Can one of the admins verify this patch?

CarbonDataQA · 2017-03-29T19:09:52Z

Build Success with Spark 1.6.2, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/1380/

jackylk · 2017-04-05T16:09:07Z

examples/spark2/src/main/scala/org/apache/carbondata/examples/CompareTest.scala

@@ -306,6 +307,8 @@ object CompareTest {
    // do GC and sleep for some time before running next table
    System.gc()
    Thread.sleep(1000)
+    System.gc()


Is this required?

There is no guarntee that GC will be called after calling of System.gc(), thats why after waiting for 1 second called again to increase the probability of running GC

jackylk · 2017-04-05T16:11:09Z

integration/spark2/src/main/scala/org/apache/spark/sql/CarbonDataFrameWriter.scala

      } else {
        ""
      }
    )
+    if (property.nonEmpty && property.charAt(property.length-1) == ',') {
+      property = property.replace(property.length-1, property.length, "")


change property.length-1 to property.length - 1

CarbonDataQA · 2017-04-06T05:47:32Z

Build Success with Spark 1.6.2, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/1461/

jackylk · 2017-04-11T01:05:21Z

@ravipesala please rebase

CarbonDataQA · 2017-04-11T05:46:23Z

Build Success with Spark 1.6.2, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/1560/

jackylk · 2017-04-11T13:01:20Z

LGTM

ravipesala force-pushed the minor-perf-improv branch 2 times, most recently from 12af0f8 to f8607a3 Compare March 30, 2017 08:15

ravipesala changed the title ~~[WIP] Improvements in query~~ [CARBONDATA-861] Improvements in query Apr 5, 2017

jackylk reviewed Apr 5, 2017

View reviewed changes

ravipesala added 7 commits April 11, 2017 10:49

Removed unnecessary array copy and bitset checking

73e515c

OPtimized code

5ddf307

Added table_blocksize option.

eecc8f9

Removed unnecessary plan from optimized plan.

0b5164f

Fixed test

30f992f

FIxed comment

ad7a329

Rebased

579b50f

ravipesala force-pushed the minor-perf-improv branch from b593705 to 579b50f Compare April 11, 2017 05:37

asfgit closed this in 4cdb7a2 Apr 11, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CARBONDATA-861] Improvements in query #709

[CARBONDATA-861] Improvements in query #709

ravipesala commented Mar 29, 2017

asfbot commented Mar 29, 2017

asfbot commented Mar 29, 2017

asfbot commented Mar 29, 2017

CarbonDataQA commented Mar 29, 2017

jackylk Apr 5, 2017

ravipesala Apr 6, 2017

jackylk Apr 5, 2017

ravipesala Apr 6, 2017

CarbonDataQA commented Apr 6, 2017

jackylk commented Apr 11, 2017

CarbonDataQA commented Apr 11, 2017

jackylk commented Apr 11, 2017

[CARBONDATA-861] Improvements in query #709

[CARBONDATA-861] Improvements in query #709

Conversation

ravipesala commented Mar 29, 2017

asfbot commented Mar 29, 2017

asfbot commented Mar 29, 2017

asfbot commented Mar 29, 2017

CarbonDataQA commented Mar 29, 2017

jackylk Apr 5, 2017

Choose a reason for hiding this comment

ravipesala Apr 6, 2017

Choose a reason for hiding this comment

jackylk Apr 5, 2017

Choose a reason for hiding this comment

ravipesala Apr 6, 2017

Choose a reason for hiding this comment

CarbonDataQA commented Apr 6, 2017

jackylk commented Apr 11, 2017

CarbonDataQA commented Apr 11, 2017

jackylk commented Apr 11, 2017