Toy Hadoop cluster combining various SQL-on-Hadoop variants
-
Updated
Nov 16, 2017 - Shell
Toy Hadoop cluster combining various SQL-on-Hadoop variants
Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
A storage reference to a comprehensive guide on installing Hadoop on Windows
Word Count and Pair Count in Text with Spark & Hadoop
Sorting of large dataset files(80GB) using Hadoop(Mapreduce) techniques and Apache Spark in Java and scheduled job on the virtual cluster(using 4 nodes) using a SLURM scheduler with bash scripting
🐘Yet another Hadoop playground
This repository contains the H1B_Visa Applicants Data Analysis project/case study using Hadoop undertaken during the training at NIIT. MapReduce,Hive,Pig,Scoop and Shell-scripting are the technologies used.
EMR 5.25.0 cluster single node Hadoop docker image. With Amazon Linux, Hadoop 2.8.5 and Hive 2.3.5
Apache Hadoop docker image | Running Python MapReduce
Scaffolding for Map/Reduce applications, leveraging Apache Hadoop.
Virtual Machine with Hadoop environment setup and ready to run map-reduce applications
Hadoop Hive practice
Hadoop Ansible Test Suite
Add a description, image, and links to the hadoop-mapreduce topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-mapreduce topic, visit your repo's landing page and select "manage topics."