MapReduce, Spark, Java, and Scala for Data Algorithms Book
-
Updated
Oct 14, 2024 - Java
MapReduce, Spark, Java, and Scala for Data Algorithms Book
hadoop-cos(CosN文件系统)为Apache Hadoop、Spark以及Tez等大数据计算框架集成提供支持,可以像访问HDFS一样读写存储在腾讯云COS上的数据。同时也支持作为Druid等查询与分析引擎的Deep Storage
Some simple, kinda introductory projects based on Apache Hadoop to be used as guides in order to make the MapReduce model look less weird or boring.
Set of Input Formats for Hadoop Streaming
The source code developed and used for the purposes of my thesis with the same title under the guidance of my supervisor professor Vasilis Mamalis for the Department of Informatics and Computer Engineering of the University of West Attica.
A small code to validate the Census data on the basis of Aadhar Data
logback appender for apache-flume
This project analyzes one month of NYC Yellow Taxi trip data (January 2016) to identify the busiest taxi pickup locations. It utilizes the Hadoop MapReduce framework to process the data and a lookup table to map location IDs to human-readable zone names.
COVID-19 data analysis with MapReduce
Java code for Apriori algorithm using MapReduce
Programs conducted at Army Institute of Technology, Pune in training on Big Data Analytics during September 2024.
Applying MapReduce in Java on a Twitter dataset using Apache Hadoop
Map reducing task with apache hadoop.
In this project we will use Hadoop MapReduce to implement a very basic “Sentiment Analysis” using the review text in the Yelp Academic Dataset as training data.
Problem of word count done using Apache Hadoop
AWS Cloudera Hadoop setup with H2O, Spark, MR
Apache Hadoop. Apache Hadoop is a collection of open-source software utilities that facilitate using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. Originally designed for co…
Apache Hadoop – A course for undergraduates | along with Apache Pig and Hive
Add a description, image, and links to the apache-hadoop topic page so that developers can more easily learn about it.
To associate your repository with the apache-hadoop topic, visit your repo's landing page and select "manage topics."