running apache spark with docker swarm
-
Updated
Feb 25, 2021 - Dockerfile
Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
running apache spark with docker swarm
PySpark in Docker Containers
Arquitectura en contenedores para Plataformas GIS: Dockerfiles y scripts para construir y desplegar un entorno reproducible de geoprocesamiento en Python, Apache Spark y Sedona
Docker setup for Apache Spark and the R sparklyr package
Collection of Apache Spark docker images for OKDP
Dockerimage of morpheus, the project from opencypher previously known as Cypher for Apache Spark
This repository holds examples and documentation about the most used tools in the data engineering ecosystem.
Small setup of development environment for Apache Spark with docker
Containers configuration saved from other tasks related to work or personal projects
Apache Spark cluster connected to a Jupyter Notebook instance
Setting up a simple Apache Spark environment used for working with Spark in a development environment.
A robust, scalable on-premises data lake
Set-up apache spark cluster with hadoop(hdfs) and airflow on docker
Pipeline de engenharia de dados ponta a ponta utilizando Arquitetura Medalhão com Spark (R), Airflow e BigQuery para análise de e-commerce brasileiro.
Lightweight Docker image for running Apache Spark with Scala 2.13.
Created by Matei Zaharia
Released May 26, 2014