#

hadoop-mapreduce

Here are 25 public repositories matching this topic...

waltherg / distributable_docker_sql_on_hadoop

Toy Hadoop cluster combining various SQL-on-Hadoop variants

Updated Nov 16, 2017
Shell

hyeonsangjeon / dataplatform

Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.

hive hadoop hadoop-cluster hadoop-mapreduce hadoop-docker pyspark-notebook zeppelin-notebook hadoop-ecosystem

Updated Nov 7, 2019
Shell

Shwetabhdixit / Hadoop-2.7.3-Installation-Guide-for_windows

A storage reference to a comprehensive guide on installing Hadoop on Windows

hadoop-cluster hadoop-mapreduce hadoop-framework

Updated Jun 11, 2018
Shell

AliAhmadi-Software / word-count-pair-count-Hadoop-Spark

Word Count and Pair Count in Text with Spark & Hadoop

spark hadoop hadoop-mapreduce

Updated Nov 21, 2024
Shell

Sabareesh19 / Sort-on-Hadoop-Spark

Sorting of large dataset files(80GB) using Hadoop(Mapreduce) techniques and Apache Spark in Java and scheduled job on the virtual cluster(using 4 nodes) using a SLURM scheduler with bash scripting

java linux spark virtual-machine slurm-job bash-script rdd hadoop-mapreduce virtual-clusters

Updated May 4, 2018
Shell

arkady-emelyanov / hadoop-playground

🐘Yet another Hadoop playground

hadoop hadoop-mapreduce yarn-hadoop-cluster hadoop-hdfs

Updated May 28, 2018
Shell

NikhilURao / H1B_VisaProject

This repository contains the H1B_Visa Applicants Data Analysis project/case study using Hadoop undertaken during the training at NIIT. MapReduce,Hive,Pig,Scoop and Shell-scripting are the technologies used.

mysql hadoop bigdata apache shell-script sqoop hadoop-filesystem hadoop-mapreduce apache-pig apache-hive

Updated Jun 26, 2019
Shell

alex-ber / docker-hive

EMR 5.25.0 cluster single node Hadoop docker image. With Amazon Linux, Hadoop 2.8.5 and Hive 2.3.5

Updated Jan 6, 2020
Shell

arminZolfaghari / docker-hadoop

Apache Hadoop docker image | Running Python MapReduce

hadoop hadoop-mapreduce docker-hadoop hadoop-hdfs mapreduce-python

Updated May 28, 2023
Shell

divithraju / divith-raju-Hadoop-3.3.6-setup-on-Ubuntu

linux shell yarn hadoop ubuntu clustering bigdata apache hdfs shell-script clustering-algorithm dataplatform hadoop-mapreduce dataengineering

Updated Mar 31, 2024
Shell

gmarciani / mapreduce-app

Scaffolding for Map/Reduce applications, leveraging Apache Hadoop.

bigdata mapreduce scaffolding batch-processing hadoop-mapreduce

Updated Jun 19, 2017
Shell

lucasmior / hadoop-vm

Virtual Machine with Hadoop environment setup and ready to run map-reduce applications

vagrant hadoop vagrant-environments hadoop-mapreduce hadoop-hdfs

Updated Jan 22, 2020
Shell

kenten132 / hadoop-Sandbox

Testing and learning Hadoop.

java hadoop-mapreduce

Updated Jun 15, 2017
Shell

darule0 / yarndiff

A rudimentary command line utility for contrasting Apache Yarn container logs.

diff spark yarn hive hadoop log4j pig mapreduce diffing difference hadoop-mapreduce yarn2

Updated Jan 8, 2024
Shell

xuyinhao / lgpbenchmark

公司内部对接Hadoop 基本正确性测试

hadoop hadoop-mapreduce hadoop-hdfs

Updated Jun 23, 2020
Shell

aaa121 / Big-Data-Analytics

python r scala sql hive pyspark pig sparkr hadoop-filesystem hadoop-mapreduce

Updated Jul 22, 2017
Shell

m-anshu / big-data-coursework

Big Data coursework material

big-data hadoop-mapreduce

Updated Oct 17, 2024
Shell

s-evsyukov / hadoop_hive

Hadoop Hive practice

yarn aws-s3 hadoop-mapreduce hadoop-hdfs hadoop-hive hive-sql

Updated Aug 9, 2022
Shell

lhuaquisto / hadoop-multicluster

spark ubuntu virtual-machine hadoop-mapreduce

Updated Aug 27, 2019
Shell

groda / hats

Hadoop Ansible Test Suite

ansible hadoop smoke-tests test-automation hadoop-spark smoke-test hadoop-mapreduce hadoop-hdfs hadoop-yarn

Updated Feb 13, 2025
Shell

Improve this page

Add a description, image, and links to the hadoop-mapreduce topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hadoop-mapreduce topic, visit your repo's landing page and select "manage topics."