[SPARK-2883][SQL] Orc support through datasource api #3753

scwf · 2014-12-21T07:23:32Z

Adding support for read/write orc files through the new datasource api.

SparkQA · 2014-12-21T07:27:24Z

Test build #24678 has started for PR 3753 at commit a99a106.

This patch merges cleanly.

SparkQA · 2014-12-21T07:30:43Z

Test build #24678 has finished for PR 3753 at commit a99a106.

This patch fails to build.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- class DefaultSource extends RelationProvider
- case class OrcRelation(path: String)(@transient val sqlContext: SQLContext)

AmplabJenkins · 2014-12-21T07:30:44Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24678/
Test FAILed.

scwf · 2014-12-21T07:37:23Z

Seems there is no OrcNewInputFormat in hive 12, which leads to compile failed based on hive 12

SparkQA · 2014-12-21T08:17:29Z

Test build #24681 has started for PR 3753 at commit 4b4e66b.

This patch merges cleanly.

SparkQA · 2014-12-21T09:28:16Z

Test build #24681 has finished for PR 3753 at commit 4b4e66b.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- class DefaultSource extends RelationProvider
- case class OrcRelation(path: String)(@transient val sqlContext: SQLContext)

AmplabJenkins · 2014-12-21T09:28:19Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24681/
Test PASSed.

liancheng · 2014-12-22T06:52:35Z

We are planning to add first class support for partitioned tables in the external data source API in 1.3. Some interface like PartitionedRelation will be provided to solve partitioning in a more general and customizable way. I'd suggest either to only support single file access in this PR, or wait for a while :)

scwf · 2014-12-23T15:22:57Z

Thanks @liancheng, so when we have this interface to support partitioned tables, or anyone is working on it? Now the partitioned table support in Orc referred to parquet implementation. In my idea, i suggest keep it here and let this go. After the partitioned table interface is ok, i will make a PR to refactory this.
/cc @marmbrus

SparkQA · 2014-12-27T15:47:33Z

Test build #24846 has started for PR 3753 at commit 1d3dce3.

This patch merges cleanly.

SparkQA · 2014-12-27T16:59:11Z

Test build #24846 has finished for PR 3753 at commit 1d3dce3.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- class DefaultSource extends RelationProvider
- case class OrcRelation(path: String)(@transient val sqlContext: SQLContext)

AmplabJenkins · 2014-12-27T16:59:14Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24846/
Test PASSed.

marmbrus · 2014-12-30T20:10:33Z

I'm working on it, and it should be part of 1.3. This PR is just adding a ton of duplicated code which is a maintenance burden so I'm hesitant to merge it in. I agree with @liancheng that we should wait.

scwf · 2014-12-30T22:59:42Z

Ok

SparkQA · 2015-02-14T02:42:54Z

Test build #27469 has started for PR 3753 at commit 9d7c082.

This patch merges cleanly.

SparkQA · 2015-02-14T02:57:31Z

Test build #27470 has started for PR 3753 at commit f21b693.

This patch merges cleanly.

SparkQA · 2015-02-14T04:10:03Z

Test build #27470 has finished for PR 3753 at commit f21b693.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- class OrcHadoopWriter(@transient jobConf: JobConf) extends SparkHadoopWriter(jobConf)

AmplabJenkins · 2015-02-14T04:10:07Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27470/
Test FAILed.

SparkQA · 2015-02-14T04:30:28Z

Test build #27469 has finished for PR 3753 at commit 9d7c082.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- class OrcHadoopWriter(@transient jobConf: JobConf) extends SparkHadoopWriter(jobConf)

AmplabJenkins · 2015-02-14T04:30:33Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27469/
Test PASSed.

scwf · 2015-02-15T02:21:23Z

@liancheng and @marmbrus , i removed the partitioned support for orc tables and added write interface based on the newly introduced write api, can you help review this? thanks

krzysztof-indyk · 2015-03-30T18:11:08Z

+1

scwf · 2015-04-21T02:22:14Z

Retest this please

SparkQA · 2015-04-21T02:28:35Z

Test build #30621 has started for PR 3753 at commit f21b693.

SparkQA · 2015-04-21T02:30:07Z

Test build #30621 has finished for PR 3753 at commit f21b693.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- class OrcHadoopWriter(@transient jobConf: JobConf) extends SparkHadoopWriter(jobConf)
This patch does not change any dependencies.

SparkQA · 2015-04-21T04:58:30Z

Test build #30634 has started for PR 3753 at commit 956c095.

SparkQA · 2015-04-21T05:02:36Z

Test build #30634 has finished for PR 3753 at commit 956c095.

This patch fails to build.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- class OrcHadoopWriter(@transient jobConf: JobConf) extends SparkHadoopWriter(jobConf)
This patch does not change any dependencies.

AmplabJenkins · 2015-04-21T05:02:37Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30634/
Test FAILed.

…asourceapi

SparkQA · 2015-04-21T06:27:53Z

Test build #30645 has started for PR 3753 at commit 0dd36ee.

SparkQA · 2015-04-21T06:29:26Z

Test build #30645 has finished for PR 3753 at commit 0dd36ee.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- class OrcHadoopWriter(@transient jobConf: JobConf) extends SparkHadoopWriter(jobConf)
This patch does not change any dependencies.

AmplabJenkins · 2015-04-21T06:29:27Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30645/
Test FAILed.

SparkQA · 2015-04-21T06:37:35Z

Test build #30648 has started for PR 3753 at commit 9788b85.

SparkQA · 2015-04-21T08:35:56Z

Test build #30648 has finished for PR 3753 at commit 9788b85.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.
This patch removes the following dependencies:
- RoaringBitmap-0.4.5.jar
- activation-1.1.jar
- akka-actor_2.10-2.3.4-spark.jar
- akka-remote_2.10-2.3.4-spark.jar
- akka-slf4j_2.10-2.3.4-spark.jar
- aopalliance-1.0.jar
- arpack_combined_all-0.1.jar
- avro-1.7.7.jar
- breeze-macros_2.10-0.11.2.jar
- breeze_2.10-0.11.2.jar
- chill-java-0.5.0.jar
- chill_2.10-0.5.0.jar
- commons-beanutils-1.7.0.jar
- commons-beanutils-core-1.8.0.jar
- commons-cli-1.2.jar
- commons-codec-1.10.jar
- commons-collections-3.2.1.jar
- commons-compress-1.4.1.jar
- commons-configuration-1.6.jar
- commons-digester-1.8.jar
- commons-httpclient-3.1.jar
- commons-io-2.1.jar
- commons-lang-2.5.jar
- commons-lang3-3.3.2.jar
- commons-math-2.1.jar
- commons-math3-3.4.1.jar
- commons-net-2.2.jar
- compress-lzf-1.0.0.jar
- config-1.2.1.jar
- core-1.1.2.jar
- curator-client-2.4.0.jar
- curator-framework-2.4.0.jar
- curator-recipes-2.4.0.jar
- gmbal-api-only-3.0.0-b023.jar
- grizzly-framework-2.1.2.jar
- grizzly-http-2.1.2.jar
- grizzly-http-server-2.1.2.jar
- grizzly-http-servlet-2.1.2.jar
- grizzly-rcm-2.1.2.jar
- groovy-all-2.3.7.jar
- guava-14.0.1.jar
- guice-3.0.jar
- hadoop-annotations-2.2.0.jar
- hadoop-auth-2.2.0.jar
- hadoop-client-2.2.0.jar
- hadoop-common-2.2.0.jar
- hadoop-hdfs-2.2.0.jar
- hadoop-mapreduce-client-app-2.2.0.jar
- hadoop-mapreduce-client-common-2.2.0.jar
- hadoop-mapreduce-client-core-2.2.0.jar
- hadoop-mapreduce-client-jobclient-2.2.0.jar
- hadoop-mapreduce-client-shuffle-2.2.0.jar
- hadoop-yarn-api-2.2.0.jar
- hadoop-yarn-client-2.2.0.jar
- hadoop-yarn-common-2.2.0.jar
- hadoop-yarn-server-common-2.2.0.jar
- ivy-2.4.0.jar
- jackson-annotations-2.4.0.jar
- jackson-core-2.4.4.jar
- jackson-core-asl-1.8.8.jar
- jackson-databind-2.4.4.jar
- jackson-jaxrs-1.8.8.jar
- jackson-mapper-asl-1.8.8.jar
- jackson-module-scala_2.10-2.4.4.jar
- jackson-xc-1.8.8.jar
- jansi-1.4.jar
- javax.inject-1.jar
- javax.servlet-3.0.0.v201112011016.jar
- javax.servlet-3.1.jar
- javax.servlet-api-3.0.1.jar
- jaxb-api-2.2.2.jar
- jaxb-impl-2.2.3-1.jar
- jcl-over-slf4j-1.7.10.jar
- jersey-client-1.9.jar
- jersey-core-1.9.jar
- jersey-grizzly2-1.9.jar
- jersey-guice-1.9.jar
- jersey-json-1.9.jar
- jersey-server-1.9.jar
- jersey-test-framework-core-1.9.jar
- jersey-test-framework-grizzly2-1.9.jar
- jets3t-0.7.1.jar
- jettison-1.1.jar
- jetty-util-6.1.26.jar
- jline-0.9.94.jar
- jline-2.10.4.jar
- jodd-core-3.6.3.jar
- json4s-ast_2.10-3.2.10.jar
- json4s-core_2.10-3.2.10.jar
- json4s-jackson_2.10-3.2.10.jar
- jsr305-1.3.9.jar
- jtransforms-2.4.0.jar
- jul-to-slf4j-1.7.10.jar
- kryo-2.21.jar
- log4j-1.2.17.jar
- lz4-1.2.0.jar
- management-api-3.0.0-b012.jar
- mesos-0.21.0-shaded-protobuf.jar
- metrics-core-3.1.0.jar
- metrics-graphite-3.1.0.jar
- metrics-json-3.1.0.jar
- metrics-jvm-3.1.0.jar
- minlog-1.2.jar
- netty-3.8.0.Final.jar
- netty-all-4.0.23.Final.jar
- objenesis-1.2.jar
- opencsv-2.3.jar
- oro-2.0.8.jar
- paranamer-2.6.jar
- parquet-column-1.6.0rc3.jar
- parquet-common-1.6.0rc3.jar
- parquet-encoding-1.6.0rc3.jar
- parquet-format-2.2.0-rc1.jar
- parquet-generator-1.6.0rc3.jar
- parquet-hadoop-1.6.0rc3.jar
- parquet-jackson-1.6.0rc3.jar
- protobuf-java-2.4.1.jar
- protobuf-java-2.5.0-spark.jar
- py4j-0.8.2.1.jar
- pyrolite-2.0.1.jar
- quasiquotes_2.10-2.0.1.jar
- reflectasm-1.07-shaded.jar
- scala-compiler-2.10.4.jar
- scala-library-2.10.4.jar
- scala-reflect-2.10.4.jar
- scalap-2.10.4.jar
- scalatest_2.10-2.2.1.jar
- slf4j-api-1.7.10.jar
- slf4j-log4j12-1.7.10.jar
- snappy-java-1.1.1.7.jar
- spark-bagel_2.10-1.4.0-SNAPSHOT.jar
- spark-catalyst_2.10-1.4.0-SNAPSHOT.jar
- spark-core_2.10-1.4.0-SNAPSHOT.jar
- spark-graphx_2.10-1.4.0-SNAPSHOT.jar
- spark-launcher_2.10-1.4.0-SNAPSHOT.jar
- spark-mllib_2.10-1.4.0-SNAPSHOT.jar
- spark-network-common_2.10-1.4.0-SNAPSHOT.jar
- spark-network-shuffle_2.10-1.4.0-SNAPSHOT.jar
- spark-repl_2.10-1.4.0-SNAPSHOT.jar
- spark-sql_2.10-1.4.0-SNAPSHOT.jar
- spark-streaming_2.10-1.4.0-SNAPSHOT.jar
- spire-macros_2.10-0.7.4.jar
- spire_2.10-0.7.4.jar
- stax-api-1.0.1.jar
- stream-2.7.0.jar
- tachyon-0.5.0.jar
- tachyon-client-0.5.0.jar
- uncommons-maths-1.2.2a.jar
- unused-1.0.0.jar
- xmlenc-0.52.jar
- xz-1.0.jar
- zookeeper-3.4.5.jar

AmplabJenkins · 2015-04-21T08:36:01Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30648/
Test FAILed.

scwf · 2015-04-21T08:40:28Z

Retest this please.

SparkQA · 2015-04-21T08:42:52Z

Test build #30657 has started for PR 3753 at commit 9788b85.

SparkQA · 2015-04-21T11:03:47Z

Test build #30657 has finished for PR 3753 at commit 9788b85.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.
This patch does not change any dependencies.

AmplabJenkins · 2015-04-21T11:03:52Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30657/
Test PASSed.

scwf · 2015-04-21T11:36:33Z

core/src/main/scala/org/apache/spark/SparkHadoopWriter.scala

+  @transient protected var format: OutputFormat[AnyRef,AnyRef] = null
+  @transient protected var committer: OutputCommitter = null
+  @transient protected var jobContext: JobContext = null
+  @transient protected var taskContext: TaskAttemptContext = null


I changed the scope of var/def of SparkHadoopWriter to reuse these code in orc writting api implementation

scwf · 2015-04-21T11:40:50Z

To make ORC datasource clean and easy to review, i will split it to three part of work, each one should be a PR.
1 orc datasource api support including read/write implementation, no partitioned support
2 filter push down optimization
3 partitioning support

This is the PR for the first point.

/cc @marmbrus @liancheng

scwf · 2015-05-06T21:53:50Z

ping

scwf · 2015-05-17T03:52:57Z

i am closing this in favor of #6914

@SInCE

This PR updates PR #6135 authored by zhzhan from Hortonworks. ---- This PR implements a Spark SQL data source for accessing ORC files. > **NOTE** > > Although ORC is now an Apache TLP, the codebase is still tightly coupled with Hive. That's why the new ORC data source is under `org.apache.spark.sql.hive` package, and must be used with `HiveContext`. However, it doesn't require existing Hive installation to access ORC files. 1. Saving/loading ORC files without contacting Hive metastore 1. Support for complex data types (i.e. array, map, and struct) 1. Aware of common optimizations provided by Spark SQL: - Column pruning - Partitioning pruning - Filter push-down 1. Schema evolution support 1. Hive metastore table conversion This PR also include initial work done by scwf from Huawei (PR #3753). Author: Zhan Zhang <zhazhan@gmail.com> Author: Cheng Lian <lian@databricks.com> Closes #6194 from liancheng/polishing-orc and squashes the following commits: 55ecd96 [Cheng Lian] Reorganizes ORC test suites d4afeed [Cheng Lian] Addresses comments 21ada22 [Cheng Lian] Adds @SInCE and @experimental annotations 128bd3b [Cheng Lian] ORC filter bug fix d734496 [Cheng Lian] Polishes the ORC data source 2650a42 [Zhan Zhang] resolve review comments 3c9038e [Zhan Zhang] resolve review comments 7b3c7c5 [Zhan Zhang] save mode fix f95abfd [Zhan Zhang] reuse test suite 7cc2c64 [Zhan Zhang] predicate fix 4e61c16 [Zhan Zhang] minor change 305418c [Zhan Zhang] orc data source support (cherry picked from commit aa31e43) Signed-off-by: Michael Armbrust <michael@databricks.com>

@SInCE

This PR updates PR #6135 authored by zhzhan from Hortonworks. ---- This PR implements a Spark SQL data source for accessing ORC files. > **NOTE** > > Although ORC is now an Apache TLP, the codebase is still tightly coupled with Hive. That's why the new ORC data source is under `org.apache.spark.sql.hive` package, and must be used with `HiveContext`. However, it doesn't require existing Hive installation to access ORC files. 1. Saving/loading ORC files without contacting Hive metastore 1. Support for complex data types (i.e. array, map, and struct) 1. Aware of common optimizations provided by Spark SQL: - Column pruning - Partitioning pruning - Filter push-down 1. Schema evolution support 1. Hive metastore table conversion This PR also include initial work done by scwf from Huawei (PR #3753). Author: Zhan Zhang <zhazhan@gmail.com> Author: Cheng Lian <lian@databricks.com> Closes #6194 from liancheng/polishing-orc and squashes the following commits: 55ecd96 [Cheng Lian] Reorganizes ORC test suites d4afeed [Cheng Lian] Addresses comments 21ada22 [Cheng Lian] Adds @SInCE and @experimental annotations 128bd3b [Cheng Lian] ORC filter bug fix d734496 [Cheng Lian] Polishes the ORC data source 2650a42 [Zhan Zhang] resolve review comments 3c9038e [Zhan Zhang] resolve review comments 7b3c7c5 [Zhan Zhang] save mode fix f95abfd [Zhan Zhang] reuse test suite 7cc2c64 [Zhan Zhang] predicate fix 4e61c16 [Zhan Zhang] minor change 305418c [Zhan Zhang] orc data source support

@SInCE

This PR updates PR apache#6135 authored by zhzhan from Hortonworks. ---- This PR implements a Spark SQL data source for accessing ORC files. > **NOTE** > > Although ORC is now an Apache TLP, the codebase is still tightly coupled with Hive. That's why the new ORC data source is under `org.apache.spark.sql.hive` package, and must be used with `HiveContext`. However, it doesn't require existing Hive installation to access ORC files. 1. Saving/loading ORC files without contacting Hive metastore 1. Support for complex data types (i.e. array, map, and struct) 1. Aware of common optimizations provided by Spark SQL: - Column pruning - Partitioning pruning - Filter push-down 1. Schema evolution support 1. Hive metastore table conversion This PR also include initial work done by scwf from Huawei (PR apache#3753). Author: Zhan Zhang <zhazhan@gmail.com> Author: Cheng Lian <lian@databricks.com> Closes apache#6194 from liancheng/polishing-orc and squashes the following commits: 55ecd96 [Cheng Lian] Reorganizes ORC test suites d4afeed [Cheng Lian] Addresses comments 21ada22 [Cheng Lian] Adds @SInCE and @experimental annotations 128bd3b [Cheng Lian] ORC filter bug fix d734496 [Cheng Lian] Polishes the ORC data source 2650a42 [Zhan Zhang] resolve review comments 3c9038e [Zhan Zhang] resolve review comments 7b3c7c5 [Zhan Zhang] save mode fix f95abfd [Zhan Zhang] reuse test suite 7cc2c64 [Zhan Zhang] predicate fix 4e61c16 [Zhan Zhang] minor change 305418c [Zhan Zhang] orc data source support

@SInCE

This PR updates PR apache#6135 authored by zhzhan from Hortonworks. ---- This PR implements a Spark SQL data source for accessing ORC files. > **NOTE** > > Although ORC is now an Apache TLP, the codebase is still tightly coupled with Hive. That's why the new ORC data source is under `org.apache.spark.sql.hive` package, and must be used with `HiveContext`. However, it doesn't require existing Hive installation to access ORC files. 1. Saving/loading ORC files without contacting Hive metastore 1. Support for complex data types (i.e. array, map, and struct) 1. Aware of common optimizations provided by Spark SQL: - Column pruning - Partitioning pruning - Filter push-down 1. Schema evolution support 1. Hive metastore table conversion This PR also include initial work done by scwf from Huawei (PR apache#3753). Author: Zhan Zhang <zhazhan@gmail.com> Author: Cheng Lian <lian@databricks.com> Closes apache#6194 from liancheng/polishing-orc and squashes the following commits: 55ecd96 [Cheng Lian] Reorganizes ORC test suites d4afeed [Cheng Lian] Addresses comments 21ada22 [Cheng Lian] Adds @SInCE and @experimental annotations 128bd3b [Cheng Lian] ORC filter bug fix d734496 [Cheng Lian] Polishes the ORC data source 2650a42 [Zhan Zhang] resolve review comments 3c9038e [Zhan Zhang] resolve review comments 7b3c7c5 [Zhan Zhang] save mode fix f95abfd [Zhan Zhang] reuse test suite 7cc2c64 [Zhan Zhang] predicate fix 4e61c16 [Zhan Zhang] minor change 305418c [Zhan Zhang] orc data source support

@SInCE

This PR updates PR apache#6135 authored by zhzhan from Hortonworks. ---- This PR implements a Spark SQL data source for accessing ORC files. > **NOTE** > > Although ORC is now an Apache TLP, the codebase is still tightly coupled with Hive. That's why the new ORC data source is under `org.apache.spark.sql.hive` package, and must be used with `HiveContext`. However, it doesn't require existing Hive installation to access ORC files. 1. Saving/loading ORC files without contacting Hive metastore 1. Support for complex data types (i.e. array, map, and struct) 1. Aware of common optimizations provided by Spark SQL: - Column pruning - Partitioning pruning - Filter push-down 1. Schema evolution support 1. Hive metastore table conversion This PR also include initial work done by scwf from Huawei (PR apache#3753). Author: Zhan Zhang <zhazhan@gmail.com> Author: Cheng Lian <lian@databricks.com> Closes apache#6194 from liancheng/polishing-orc and squashes the following commits: 55ecd96 [Cheng Lian] Reorganizes ORC test suites d4afeed [Cheng Lian] Addresses comments 21ada22 [Cheng Lian] Adds @SInCE and @experimental annotations 128bd3b [Cheng Lian] ORC filter bug fix d734496 [Cheng Lian] Polishes the ORC data source 2650a42 [Zhan Zhang] resolve review comments 3c9038e [Zhan Zhang] resolve review comments 7b3c7c5 [Zhan Zhang] save mode fix f95abfd [Zhan Zhang] reuse test suite 7cc2c64 [Zhan Zhang] predicate fix 4e61c16 [Zhan Zhang] minor change 305418c [Zhan Zhang] orc data source support

scwf mentioned this pull request Dec 21, 2014

[WIP][SPARK-2883][SQL]initial support ORC in spark sql #2576

Closed

Orc support through datasource api

f2c246f

scwf force-pushed the orc-datasourceapi branch from 1d3dce3 to f2c246f Compare February 14, 2015 02:38

style fix

f21b693

scwf force-pushed the orc-datasourceapi branch from 9d7c082 to f21b693 Compare February 14, 2015 02:54

fix style

956c095

Merge branch 'master' of https://github.com/apache/spark into orc-dat…

d96b657

…asourceapi

fix compile error

9788b85

scwf force-pushed the orc-datasourceapi branch from 0dd36ee to 9788b85 Compare April 21, 2015 06:32

scwf reviewed Apr 21, 2015
View reviewed changes

scwf mentioned this pull request May 14, 2015

[SPARK-2883][SQL] Spark Support for ORCFile with New Framework #6135

Closed

liancheng mentioned this pull request May 15, 2015

[SPARK-2883] [SQL] ORC data source for Spark SQL #6194

Closed

scwf closed this May 17, 2015

scwf deleted the orc-datasourceapi branch May 17, 2015 03:53

[SPARK-2883][SQL] Orc support through datasource api #3753

[SPARK-2883][SQL] Orc support through datasource api #3753

Uh oh!

Conversation

scwf commented Dec 21, 2014

Uh oh!

SparkQA commented Dec 21, 2014

Uh oh!

SparkQA commented Dec 21, 2014

Uh oh!

AmplabJenkins commented Dec 21, 2014

Uh oh!

scwf commented Dec 21, 2014

Uh oh!

SparkQA commented Dec 21, 2014

Uh oh!

SparkQA commented Dec 21, 2014

Uh oh!

AmplabJenkins commented Dec 21, 2014

Uh oh!

liancheng commented Dec 22, 2014

Uh oh!

scwf commented Dec 23, 2014

Uh oh!

SparkQA commented Dec 27, 2014

Uh oh!

SparkQA commented Dec 27, 2014

Uh oh!

AmplabJenkins commented Dec 27, 2014

Uh oh!

marmbrus commented Dec 30, 2014

Uh oh!

scwf commented Dec 30, 2014

Uh oh!

SparkQA commented Feb 14, 2015

Uh oh!

SparkQA commented Feb 14, 2015

Uh oh!

SparkQA commented Feb 14, 2015

Uh oh!

AmplabJenkins commented Feb 14, 2015

Uh oh!

SparkQA commented Feb 14, 2015

Uh oh!

AmplabJenkins commented Feb 14, 2015

Uh oh!

scwf commented Feb 15, 2015

Uh oh!

krzysztof-indyk commented Mar 30, 2015

Uh oh!

scwf commented Apr 21, 2015

Uh oh!

SparkQA commented Apr 21, 2015

Uh oh!

SparkQA commented Apr 21, 2015

Uh oh!

SparkQA commented Apr 21, 2015

Uh oh!

SparkQA commented Apr 21, 2015

Uh oh!

AmplabJenkins commented Apr 21, 2015

Uh oh!

SparkQA commented Apr 21, 2015

Uh oh!

SparkQA commented Apr 21, 2015

Uh oh!

AmplabJenkins commented Apr 21, 2015

Uh oh!

SparkQA commented Apr 21, 2015

Uh oh!

SparkQA commented Apr 21, 2015

Uh oh!

AmplabJenkins commented Apr 21, 2015

Uh oh!

scwf commented Apr 21, 2015

Uh oh!

SparkQA commented Apr 21, 2015

Uh oh!

SparkQA commented Apr 21, 2015

Uh oh!