Support Spark 1.3 #384

Leemoonsoo · 2015-03-13T18:46:35Z

Spark 1.3 is released.
This PR make Zeppelin work with Spark 1.3.

Add profile
Make Zeppelin build with Spark 1.3
Take care of SchemaRDD -> DataFrame
Test on cluster environment

swkimme · 2015-03-13T20:11:26Z

You're moving so fast!

Leemoonsoo · 2015-03-14T02:50:53Z

Ready to be merged!

Note that implicit conversion from RDD -> DataFrame is not working.
ie. following code is failing

case class Person(name:String)
val person = sc.parallelize(List(Person("hello"), Person("world")))
person.registerTempTable("person")  // fails

The same problem exists in spark-shell, too.

swkimme · 2015-03-14T12:03:55Z

Great demo! +1 for merge.

On 2015년 3월 14일 (토) 11:50 Lee moon soo notifications@github.com wrote:

Ready to be merged!

[image: image]
https://cloud.githubusercontent.com/assets/1540981/6649957/bda0b1d4-ca3d-11e4-908e-da6ad1bd172d.png

Note that implicit conversion from RDD -> DataFrame is not working.
ie. following code is failing

case class Person(name:String)val person = sc.parallelize(List(Person("hello"), Person("world")))
person.registerTempTable("person") // fails

The same problem exists in spark-shell, too.

—
Reply to this email directly or view it on GitHub
#384 (comment).

syepes · 2015-03-14T13:00:56Z

+1 will be test driving 1.3 this week

Leemoonsoo · 2015-03-14T14:41:17Z

According to https://spark.apache.org/docs/latest/sql-programming-guide.html#starting-point-sqlcontext, HiveContext is preferable one than using SparkContext.

I pushed one more change that if HiveContext (hive related dependency is loaded) is available, use it instead of SparkContext

syepes · 2015-03-14T17:05:50Z

@Leemoonsoo have you tried:

person.toDF.registerTempTable("person")

syepes · 2015-03-14T20:49:22Z

@felixcheung I am with you, It would be better to leave it up to the user to make the choice. I personally use the CassandraSQLContext

Leemoonsoo · 2015-03-15T00:12:07Z

@syepes Thanks for letting know a way to registerTempTable.

https://github.com/apache/spark/blob/v1.3.0/repl/scala-2.10/src/main/scala/org/apache/spark/repl/SparkILoop.scala#L1022
spark's createSQLContext() api always creates HiveContext, creates SQLContext when it fails.
so i tried to the same in Zeppelin. But i agree on giving user an option.

Then, @felixcheung, @syepes, how about bringing zeppelin.spark.useHiveContext property back with default value 'true'? Which is just removed in this PullRequest. Previously, default value was 'false'

syepes · 2015-03-15T07:09:52Z

@Leemoonsoo No problem, the usage of the useHiveContext is a good alternative.
I will be updating my fork that has the useCassandraSqlContext option with you changes.

Thanks for the work on 1.3

felixcheung · 2015-03-15T07:15:22Z

@Leemoonsoo sounds good to me too.

geekflyer · 2015-03-15T22:58:25Z

@Leemoonsoo I've tried to execute your very exact same example, however it appears the df val is not passed along to the sql editor and I get the message no such table List(df).

I'm running running spark standalone with 2 workers. Spark version 1.3.0 on Ubuntu 14.04 LTS.
Zeppelin was built using mvn clean package -Pspark-1.3 -Dhadoop.version=2.2.0 -Phadoop-2.2 -DskipTests from the spark_1.3 branch.

Any idea what causes the problem?

… true

Leemoonsoo · 2015-03-16T06:40:00Z

@geekflyer
Thanks for trying this branch. I missed one statement in my screenshot. You need register DataFrame as a table before making sql query, like

df.toDF.registerTempTable("df")

Leemoonsoo · 2015-03-16T07:42:37Z

zeppelin.spark.useHiveContext property is restored with defaultValue 'true'.
I'm merging it if there're no more issues on this branch!

swkimme · 2015-03-16T08:10:47Z

+1 for merge

Support Spark 1.3

geekflyer · 2015-03-16T09:52:20Z

@Leemoonsoo Thanks for you help. Now it works completely fine :-)

Linked **[JIRA]** [JIRA]: https://issues.apache.org/jira/browse/ZEPPELIN-382?jql=project%20%3D%20ZEPPELIN Author: DuyHai DOAN <doanduyhai@gmail.com> Closes ZEPL#384 from doanduyhai/CassandraInterpreterDocumentation and squashes the following commits: b0bf36a [DuyHai DOAN] [ZEPPELIN-382] Add Documentation for Cassandra interpreter in the doc pages

Leemoonsoo added 3 commits March 14, 2015 03:40

Add spark-1.3 profile

3058d6d

Make code compatible with spark-1.3

f30e08f

Build profile description

f3dcec7

Leemoonsoo added 2 commits March 14, 2015 11:05

Fix reflection getMethod parameter type

be47799

Remove comment on version 1.1.x

1e43c5c

use HiveContext by default for spark 1.3

f11afb6

HiveContext as defualt sqlContext, SQLContext as a fallback

c7f7f8f

Now hiveql passes the test

0037741

Get progress information in %sql interpreter

c6350b6

Bringing zeppelin.spark.useHiveContext option back with default value…

1012fb4

… true

Fix NPE when %sql is called before %spark

124d334

Leemoonsoo added a commit that referenced this pull request Mar 16, 2015

Merge pull request #384 from NFLabs/spark_1.3

c84347d

Support Spark 1.3

Leemoonsoo merged commit c84347d into master Mar 16, 2015

Leemoonsoo deleted the spark_1.3 branch March 16, 2015 08:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Spark 1.3 #384

Support Spark 1.3 #384

Leemoonsoo commented Mar 13, 2015

swkimme commented Mar 13, 2015

Leemoonsoo commented Mar 14, 2015

swkimme commented Mar 14, 2015

syepes commented Mar 14, 2015

Leemoonsoo commented Mar 14, 2015

syepes commented Mar 14, 2015

syepes commented Mar 14, 2015

Leemoonsoo commented Mar 15, 2015

syepes commented Mar 15, 2015

felixcheung commented Mar 15, 2015

geekflyer commented Mar 15, 2015

Leemoonsoo commented Mar 16, 2015

Leemoonsoo commented Mar 16, 2015

swkimme commented Mar 16, 2015

geekflyer commented Mar 16, 2015

Support Spark 1.3 #384

Support Spark 1.3 #384

Conversation

Leemoonsoo commented Mar 13, 2015

swkimme commented Mar 13, 2015

Leemoonsoo commented Mar 14, 2015

swkimme commented Mar 14, 2015

syepes commented Mar 14, 2015

Leemoonsoo commented Mar 14, 2015

syepes commented Mar 14, 2015

syepes commented Mar 14, 2015

Leemoonsoo commented Mar 15, 2015

syepes commented Mar 15, 2015

felixcheung commented Mar 15, 2015

geekflyer commented Mar 15, 2015

Leemoonsoo commented Mar 16, 2015

Leemoonsoo commented Mar 16, 2015

swkimme commented Mar 16, 2015

geekflyer commented Mar 16, 2015