Skip to content

This project implements a distributed dataflow platform for processing large amount (big-data) of key-value pairs, where keys and values are integers.

License

Notifications You must be signed in to change notification settings

AlessandroConti11/Fault-Tolerant_Data_Flow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Fault-Tolerant Dataflow Platform

Author:

License: MIT license

Tags: #Apache-Flink, #big-data, #computer_engineering, #distributed_system, #fault-tollerance, #java, #map_reduce, #protobuf, #polimi.

University

Politecnico di Milano.

Academic Year: 2023/2024.

090950 - Distributed Systems - professor Cugola Giampaolo Saverio - optional project.

Specification

Specification overview:

Implement a distributed dataflow platform for processing large amount (big-data) of key-value pairs, where keys and values are integers.
The platform includes a coordinator and multiple workers running on multiple nodes of a distributed system. The coordinator accepts dataflow programs specified as an arbitrarily long sequence of the above operators.

Full specification are in the Specification/projects_2023-2024

How to run

The steps specified below are suitable for a Unix environment.

  1. set environment variables in the .env file
    • INET_IFACE
    • FAULTY_THREADS
    • FAULTY_THREADS_SECS_INTERVAL
    • FAULT_PROBABILITY
  2. compile the proto message
    ./run proto
  3. run the allocator
    • allocates the WorkerManagers
    • allocates the Coordinator
    ./run alloc
  4. run the client
    ./run client ADDRESS_COORDINATOR_ALLOC OTHER_ALLOC_ADDRESSES
    Optionally, it is possible to specify the operations and files to be executed.

Final Consideration

Final Evaluation: 4/4

About

This project implements a distributed dataflow platform for processing large amount (big-data) of key-value pairs, where keys and values are integers.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 2

  •  
  •  

Languages