Name	Name	Last commit message	Last commit date
Latest commit History 16 Commits
mongodb	mongodb
spark	spark
README.md	README.md
docker-compose.yml	docker-compose.yml

Name

Last commit message

Last commit date

mongodb

spark

README.md

docker-compose.yml

Docker for MongoDB and Apache Spark.

An example of docker-compose to set up a single Apache Spark node connecting to MongoDB via * MongoDB Spark Connector

** For demo purposes only **

Starting up

You can start by running command :

docker-compose run spark bash

Which would run the spark node and the mongodb node, and provides you with bash shell for the spark.

From the spark instance, you could reach the MongoDB instance using mongodb hostname.

You can find a small dataset example in /home/ubuntu/times.json which you can load using initDocuments.scala :

${SPARK_HOME}/bin/spark-shell --conf "spark.mongodb.input.uri=mongodb://mongodb:27017/spark.times" --conf "spark.mongodb.output.uri=mongodb://mongodb/spark.output" --packages org.mongodb.spark:mongo-spark-connector_${SCALA_VERSION}:${MONGO_SPARK_VERSION} -i ./initDocuments.scala

For examples, please see reduceByKey.scala to query from mongodb, run a simple aggregation, and finally write output back to mongodb. This file will also be available inside of the spark container in /home/ubuntu/reduceByKey.scala

Run the spark shell by executing:

${SPARK_HOME}/bin/spark-shell --conf "spark.mongodb.input.uri=mongodb://mongodb:27017/spark.times" --conf "spark.mongodb.output.uri=mongodb://mongodb/spark.output" --packages org.mongodb.spark:mongo-spark-connector_${SCALA_VERSION}:${MONGO_SPARK_VERSION}

You can also append -i <file.scala> to execute a scala file via the spark shell.

More Information.

See related article:

About

An example of docker compose to set up a single Spark node connecting to MongoDB via Spark Connector

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Docker for MongoDB and Apache Spark.

Starting up

More Information.

About

Uh oh!

Releases

Packages

Languages

sindbach/mongodb-spark-docker

Folders and files

Latest commit

History

Repository files navigation

Docker for MongoDB and Apache Spark.

Starting up

More Information.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages