Serverless Spark IoT Framework (RL-Optimized)

🎓 Master's Thesis Project

Subject: Hierarchical, Edge-Aware Reinforcement Learning for Serverless IoT Resource Optimization.

🚀 Overview

This framework implements a novel Reinforcement Learning (RL) Optimizer for Serverless Spark workloads processing High-Velocity IoT data. Unlike traditional auto-scalers (HPA) or static heuristics, this system uses a Contextual PPO Agent to jointly optimize:

Compute: Number of Executors & Memory per Executor.
Storage: Shuffle Tier Selection (Redis vs NVMe vs S3) & Compression.
Proactivity: Reacts to Edge Signals (Burst Prediction) before load arrives.

✨ Key System Contributions

1. Hierarchical Contextual RL (Meta-Controller)

Problem: Fixed reward weights ($\alpha, \beta, \gamma$) fail when workload priorities shift (e.g., Budget vs Critical Mode).
Solution: A Meta-Controller (TD3) dynamically tunes the lower-level agent's reward function based on the current IoT_Scenario.
Report: Meta-Controller Report

2. Adaptive Shuffle & Storage Tiering (Step 4)

Problem: Shuffle cost/latency is a bottleneck. "One size fits all" storage fails for variable data temperatures.
Solution: The RL Agent selects the optimal Storage Tier (HOT/WARM/COLD) and Compression Level based on Data_Temperature (Reuse Probability).
Report: Adaptive Shuffle Report

3. Edge-Cloud State Awareness (Step 5)

Problem: Cloud auto-scaling is reactive (lags behind bursts).
Solution: Ingests Burst_Prediction signals from Edge Gateways into the RL State Vector (Dim=13), enabling Proactive Scaling.
Report: Edge-Cloud Report

🛠️ Architecture & Tech Stack

IoT Devices → Edge Processing → MQTT/Kafka → Serverless Spark → Output
              (Burst Pred)     (Ingestion)   (RL Optimizer)    (Dashboard)
                                             (4-Tier Shuffle)

Processing: Apache Spark 3.5 (Structured Streaming)
RL Framework: Stable-Baselines3 (PPO, TD3), Gymnasium
Cloud/Infra: Docker, Kubernetes (Simulated for Thesis)
State Store: Redis (Hot), NVMe (Warm), S3 (Cold)

💻 Installation & Usage

1. Prerequisites

Python 3.10+
Java 11+
Docker (Optional for full stack)

2. Setup

git clone https://github.com/manvith2003/serverless-spark-iot-framework.git
cd serverless-spark-iot-framework
pip install -r requirements.txt

3. Run Experiments

Train the RL Agent:

python optimization/resource_allocation/ppo_agent.py --train --timesteps 100000

Run Benchmark Tournament (Evaluation):

python benchmarks/run_eval.py

Output: Generates benchmarks/results_summary.csv and benchmark_plots.png.

📊 Evaluation Results (Step 6)

We conducted a head-to-head tournament over a stochastic 500-step trace.

Policy	Description	SLA Violations	Verdict
Fixed	Static 15 Executors	90.0%	❌ Failed (Overloaded)
Dexter	Standard HPA (Reactive)	54.4%	❌ Too slow for IoT Bursts
Seer	Linear Predictive	100.0%	❌ Under-provisioned Shuffle
Edge-Aware RL	Proposed System	19.2%	✅ Superior Stability

Thesis Conclusion: The Edge-Aware Contextual RL agent incurs a moderate cost premium (~9%) to achieve a 3x improvement in Service Reliability.

🎓 Research Information

This work is part of a research project at Indian Institute of Information Technology Kottayam.

Authors: Manvith M
Advisor: Dr. Shajulin Benedict
Contact: manvith131250@gmail.com

📂 Project Structure

.
├── benchmarks/                 # Evaluation Scripts & Policies
│   ├── policies.py             # Baseline implementations
│   └── run_eval.py             # Tournament runner
├── configs/                    # Configuration files
├── optimization/               # Core RL Logic
│   ├── meta_controller/        # TD3 Meta-Agent
│   └── resource_allocation/    # PPO Resource Agent
├── spark_core/                 # Spark Integration
└── reports/                    # Generated Technical Reports

📝 License

MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
benchmarks		benchmarks
configs		configs
cross_cloud		cross_cloud
dashboard		dashboard
data/models		data/models
demos		demos
edge_processing		edge_processing
evaluation		evaluation
ingestion		ingestion
optimization		optimization
rl_core		rl_core
scripts		scripts
serverless/openfaas		serverless/openfaas
spark_core		spark_core
state_management		state_management
tests		tests
utils		utils
.gitignore		.gitignore
DEMO_GUIDE.md		DEMO_GUIDE.md
README.md		README.md
adaptive_shuffle_report.txt		adaptive_shuffle_report.txt
benchmark_comparison_real.png		benchmark_comparison_real.png
benchmarking_report.txt		benchmarking_report.txt
docker-compose.yml		docker-compose.yml
edge_cloud_report.txt		edge_cloud_report.txt
environment.yml		environment.yml
final_achievements_table.txt		final_achievements_table.txt
final_two_rl_architecture.png		final_two_rl_architecture.png
generate_real_graphs.py		generate_real_graphs.py
incremental_benchmark_report.txt		incremental_benchmark_report.txt
inspect_model.py		inspect_model.py
master_project_report.txt		master_project_report.txt
meta_controller_report.txt		meta_controller_report.txt
ppo_reward_curve.png		ppo_reward_curve.png
presentation_slides.tex		presentation_slides.tex
requirements.txt		requirements.txt
rl_learning_curves.png		rl_learning_curves.png
serverless_spark_iot_thesis_report.md		serverless_spark_iot_thesis_report.md
td3_weights_curve.png		td3_weights_curve.png
thesis_300_page_master.txt		thesis_300_page_master.txt
thesis_full.tex		thesis_full.tex
verify_end_to_end_scenarios.py		verify_end_to_end_scenarios.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Serverless Spark IoT Framework (RL-Optimized)

🎓 Master's Thesis Project

🚀 Overview

✨ Key System Contributions

1. Hierarchical Contextual RL (Meta-Controller)

2. Adaptive Shuffle & Storage Tiering (Step 4)

3. Edge-Cloud State Awareness (Step 5)

🛠️ Architecture & Tech Stack

💻 Installation & Usage

1. Prerequisites

2. Setup

3. Run Experiments

📊 Evaluation Results (Step 6)

🎓 Research Information

📂 Project Structure

📝 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

manvith2003/serverless-spark-iot-framework

Folders and files

Latest commit

History

Repository files navigation

Serverless Spark IoT Framework (RL-Optimized)

🎓 Master's Thesis Project

🚀 Overview

✨ Key System Contributions

1. Hierarchical Contextual RL (Meta-Controller)

2. Adaptive Shuffle & Storage Tiering (Step 4)

3. Edge-Cloud State Awareness (Step 5)

🛠️ Architecture & Tech Stack

💻 Installation & Usage

1. Prerequisites

2. Setup

3. Run Experiments

📊 Evaluation Results (Step 6)

🎓 Research Information

📂 Project Structure

📝 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages