ReSimHub

ReSimHub is a scalable, research-grade backend framework designed for reinforcement learning (RL) experimentation, simulation, and benchmarking.
It provides RESTful and asynchronous APIs for managing simulation environments, training orchestration, and agent evaluation — all optimised for distributed systems and reproducible research.

Built for modern RL pipelines — where experimentation, asynchronous training, and performance evaluation converge.

Key Features

Hybrid Flask–FastAPI Framework — Combines Flask’s flexibility with FastAPI’s async capabilities.
Experiment Management APIs — Create, register, and manage experiments programmatically.
Distributed Orchestration — Scalable Celery + Redis job queues for RL training workloads.
Data Processing Layer — NumPy/Pandas-powered analytics for logs, metrics, and benchmarking.
Unified API Gateway — Seamless bridge between Flask and FastAPI services.
Evaluation & Benchmarking APIs — Compare and score RL agents using consistent metrics.
Observability Stack — Prometheus and Grafana integration for monitoring.
Containerised Deployment — Docker- and Kubernetes-ready for research clusters.

Development Roadmap

Stage	Focus Area	Objective
Stage 1	Project Bootstrap	Initialise structure, dependencies, hybrid Flask–FastAPI framework, and CI pipeline.
Stage 2	Core Experimentation APIs	Create experiment management, environment registration, and metadata models.
Stage 3	Async Orchestration	Integrate Celery + Redis for distributed training tasks.
Stage 4	Data Processing Layer	Add NumPy/Pandas-powered services for results and benchmarking.
Stage 5	Flask–FastAPI Bridge	Implement communication bridge and unified API gateway.
Stage 6	Evaluation & Benchmarking APIs	Develop endpoints for agent evaluation and comparative benchmarking.
Stage 7	Observability & Persistence	Integrate DB persistence, monitoring, and structured logging.
Stage 8	End-to-End Test & Deployment	Containerise, test, and deploy with Docker/Kubernetes.

Architecture Overview

                        ┌──────────────────────────────┐
                        │        REST Clients          │
                        └─────────────┬────────────────┘
                                      │
                                      │
          ┌───────────────────────────┼───────────────────────────────┐
          │                           │                               │
          │                           │                               │
  ┌───────▼────────┐        ┌─────────▼────────┐            ┌─────────▼─────────┐
  │    Flask API   │ ◄────► │ FastAPI Core     │ ◄─────────►│ Benchmarking API  │
  │ (Legacy/Sync)  │        │ (Async Gateway)  │            │ (Model Eval Layer)│
  └───────┬────────┘        └─────────┬────────┘            └─────────┬─────────┘
          │                           │                               │
          │                           │                               │
  ┌───────▼────────┐          ┌────────▼────────┐             ┌────────▼─────────┐
  │ Celery Workers │◄────────►│ Evaluation Queue│◄────────────│ Model Uploads    │
  │ (Distributed)  │          │ (Redis MQ)      │             │ & Orchestration  │
  └───────┬────────┘          └─────────────────┘             └──────────────────┘
          │
          │
    ┌─────▼──────┐              ┌───────────────┐        ┌────────────────────┐
    │ PostgreSQL │◄────────────►│ Redis Cache   │◄──────►│ Prometheus/Grafana │
    └────────────┘              └───────────────┘        └────────────────────┘

Service Communication Flow

                 ┌────────────────────────────────────────────────────┐
                 │                    REST Clients                    │
                 │        (curl, Postman, Frontend, Notebooks)        │
                 └───────────────────────┬────────────────────────────┘
                                         │
                                         ▼
                          ┌──────────────────────────────┐
                          │       FastAPI Service        │
                          │ (Async REST + Training API)  │
                          └──────────────┬───────────────┘
                                         │
                                         ▼
                     ┌──────────────────────────────────────────┐
                     │           Benchmarking API               │
                     │ (Model Uploads + Eval Orchestration)     │
                     └───────────────────┬──────────────────────┘
                                         │
                                         ▼
                         ┌────────────────────────────────┐
                         │         Celery Workers         │
                         │ (Distributed RL orchestration) │
                         └───────────────┬────────────────┘
                                         │
                                         ▼
                         ┌───────────────────────────────┐
                         │          Redis MQ             │
                         │ (Job Queue + Task Results)    │
                         └───────────────────────────────┘
                                         │
   ┌────────────────────┐                │                 ┌────────────────────┐
   │   Flask Service    │◄───────────────┘────────────────►│   PostgreSQL DB    │
   │ (Frontend Gateway) │                │                 │ (Experiment Store) │
   └────────────────────┘                │                 └────────────────────┘
                                         │
                                         ▼
                         ┌───────────────────────────────┐
                         │   Prometheus / Grafana Stack  │
                         │   (Observability & Metrics)   │
                         └───────────────────────────────┘

Installation

Prerequisites

Python 3.10+
Docker & Docker Compose (optional)
Redis & PostgreSQL instances (local or containerised)

Clone the Repository

git clone https://github.com/imosudi/ReSimHub.git
cd ReSimHub

Setup Virtual Environment

python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Environment Variables

Create a .env file in the project root:

# Flask & FastAPI
APP_ENV=development
SECRET_KEY=changeme

# Database
DATABASE_URL=postgresql://user:password@localhost:5432/resimhub

# Redis & Celery
REDIS_URL=redis://localhost:6379/0
CELERY_BROKER_URL=${REDIS_URL}
CELERY_RESULT_BACKEND=${REDIS_URL}

Running the Project

Run Flask & FastAPI Services

# In separate terminals
python backend/main.py 

uvicorn backend.fastapi_app.main:app --reload --port 8000

or, with Docker:

docker-compose up --build

Launch Celery Workers

celery -A backend.fastapi_app.services.orchestrator.celery_app worker --loglevel=info

Example Usage

Register an Environment

curl -X POST http://localhost:8000/api/environments \
     -H "Content-Type: application/json" \
     -d '{"env_name": "CartPole-v1", "version": "v1"}'

Output

{"id":1,"env_name":"CartPole-v1","version":"v1","registered_at":"2025-10-29T06:00:00"}

Create a New Experiment

curl -X POST http://localhost:8000/api/experiments \
     -H "Content-Type: application/json" \
     -d '{"name": "CartPole-v1", "agent": "DQN", "episodes": 500}'

Output

{"id":1,"name":"CartPole-v1","agent":"DQN","episodes":500,"status":"created","created_at":"2025-10-29T06:01:00"}

Launch Training via Flask Proxy

curl -X POST http://localhost:5000/api/v1/start_training \
     -H "Content-Type: application/json" \
     -d '{"experiment_id": 1, "env_name": "CartPole-v1", "algo": "DQN"}'

Output

{
  "queued_at": "2025-10-30T06:07:06.472880",
  "status": "queued",
  "task_id": "9821142c-3450-4bba-84af-7df037705bb6"
}

Retrieve Analytics for All Experiments

curl http://127.0.0.1:5000/api/v1/analytics/recent

Output

{
  "summary": {
    "total_experiments": 3,
    "avg_reward": 182.4,
    "avg_duration": 47.2
  },
  "experiments": [
    {"id":1,"agent":"DQN","avg_reward":190.2},
    {"id":2,"agent":"PPO","avg_reward":176.0},
    {"id":3,"agent":"A2C","avg_reward":181.0}
  ]
}

Retrieve Analytics for a Specific Experiment (experiment_id: 1)

curl http://127.0.0.1:5000/api/v1/analytics/experiment/1

Output

{
  "experiment_id":1,
  "agent":"DQN",
  "avg_reward":190.2,
  "episodes":500,
  "status":"completed"
}

Upload a Model (multipart/form-data)

# upload sample model ReSimHub/docs/dqn_model.pkl
# replace with actual intended model path
curl -F "file=@docs/dqn_model.pkl" \
     http://127.0.0.1:8000/benchmark/upload_model

Output

{
  "model_id": "mdl_afdbb795",
  "status": "uploaded",
  "uploaded_at": "2025-10-30T14:16:26.241943"
}

Simulate Evaluation of an Uploaded Model

curl -X POST http://127.0.0.1:8000/benchmark/run \
     -F "model_id=mdl_afdbb795" \
     -F "env_name=CartPole-v1" \
     -F "episodes=50"

Output

{
  "model_id": "mdl_afdbb795",
  "env_name": "CartPole-v1",
  "mean_reward": 203.68,
  "std_reward": 34.65,
  "median_reward": 204.18,
  "latency_ms": 26.02,
  "total_episodes": 50,
  "status": "completed",
  "evaluated_at": "2025-10-30T14:23:19.748328"
}

List Recent Benchmark Results

curl http://127.0.0.1:8000/benchmark/recent

Output

{
  "count": 2,
  "results": [
    {
      "model_id": "mdl_afdbb795",
      "env_name": "CartPole-v1",
      "mean_reward": 203.68,
      "std_reward": 34.65,
      "median_reward": 204.18,
      "latency_ms": 26.02,
      "total_episodes": 50,
      "status": "completed",
      "evaluated_at": "2025-10-30T14:23:19.748328"
    },
    {
      "model_id": "mdl_afdbb795",
      "env_name": "CartPole-v1",
      "mean_reward": 198.68,
      "std_reward": 37.13,
      "median_reward": 188.05,
      "latency_ms": 23.86,
      "total_episodes": 50,
      "status": "completed",
      "evaluated_at": "2025-10-30T14:22:28.345363"
    }
  ]
}

Compare Models

curl "http://127.0.0.1:8000/benchmark/compare?model_ids=mdl_afdbb795,mdl_12345678&env=CartPole-v1"

Output

{
  "count": 1,
  "comparison": [
    {
      "model_id": "mdl_afdbb795",
      "env_name": "CartPole-v1",
      "mean_reward": 204.06,
      "std_reward": 31.64,
      "median_reward": 197.33,
      "latency_ms": "np.float64(25.53)",
      "total_episodes": "50",
      "status": "completed",
      "evaluated_at": "2025-10-30T14:32:39.036548"
    }
  ]
}

Research Context — RL Infrastructure Landscape

The ReSimHub framework emerges from an analysis of the Reinforcement Learning (RL) infrastructure landscape, as documented in docs/rl_landscape.md.

This contextual study explores:

Current gaps between RL research tools and production-grade simulation systems
The fragmentation of orchestration and analytics workflows in existing frameworks
The need for unified experiment orchestration, analytics pipelines, and scalable evaluation layers

By addressing these gaps, ReSimHub provides a bridge between academic experimentation and scalable applied RL systems. This background analysis establishes the motivation for ReSimHub’s architecture, ensuring that design choices align with real-world reproducibility and scalability challenges. 📖 For full details, see: RL Landscape Analysis →

Testing

ReSimHub includes a comprehensive testing suite for API validation, benchmark execution, and quality assurance.

Quick Test Run

# Install dependencies
pip install -r requirements.txt

# Prepare storage
mkdir -p storage/models

# Run tests
python -m pytest tests/test_benchmark_api.py -v --disable-warnings

Expected Results

tests/test_benchmark_api.py::test_upload_model PASSED                    [ 20%]
tests/test_benchmark_api.py::test_run_benchmark PASSED                   [ 40%]
tests/test_benchmark_api.py::test_list_recent_results PASSED             [ 60%]
tests/test_benchmark_api.py::test_compare_models PASSED                  [ 80%]
tests/test_benchmark_api.py::test_invalid_compare_model_id PASSED        [100%]

================================================= 5 passed in 1.33s ==================================================

Complete Testing Documentation

For detailed setup instructions, configuration options, CI/CD integration, and troubleshooting guides, see:

→ Complete Testing Guide

Road Ahead

Multi-agent orchestration and scheduling
REST → gRPC bridge
ReSimHub Dashboard (React + WebSocket visualisation)
Plugin system for custom RL environments
Automated benchmark publishing (OpenAI Gym, PettingZoo)

License

This project is licensed under the BSD 3-Clause License — see the LICENSE file for details.

BSD 3-Clause License

Copyright (c) 2025, Mosudi Isiaka
All rights reserved.

👤 Author

Mosudi Isiaka
📧 mosudi.isiaka@gmail.com
🌐 https://mioemi.com
💻 https://github.com/imosudi

Contributing

Contributions are welcome!
Please open an issue or pull request to suggest new features, improvements, or bug fixes.

Citation (Academic Use)

If you use ReSimHub in your research, please cite as:

@software{ReSimHub2025,
  author = {Isiaka, Mosudi},
  title = {ReSimHub: Scalable Research Backend for Reinforcement Learning Experimentation},
  year = {2025},
  url = {https://github.com/imosudi/ReSimHub},
  license = {BSD-3-Clause}
}

“ReSimHub bridges simulation, orchestration, and reproducible reinforcement learning - for scalable research you can trust.”

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
.github/workflows		.github/workflows
.vscode		.vscode
backend		backend
docker		docker
docs		docs
k8s		k8s
middlewares		middlewares
monitoring		monitoring
scripts		scripts
shared		shared
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ReSimHub.svg		ReSimHub.svg
ReSimHub_mini_icon.svg		ReSimHub_mini_icon.svg
docker-compose.override.yml		docker-compose.override.yml
requirements.txt		requirements.txt
resimhub.db		resimhub.db

License

imosudi/ReSimHub

Folders and files

Latest commit

History

Repository files navigation

ReSimHub

Key Features

Development Roadmap

Architecture Overview

Service Communication Flow

Installation

Prerequisites

Clone the Repository

Setup Virtual Environment

Environment Variables

Running the Project

Run Flask & FastAPI Services

Launch Celery Workers

Example Usage

Register an Environment

Create a New Experiment

Launch Training via Flask Proxy

Retrieve Analytics for All Experiments

Retrieve Analytics for a Specific Experiment (experiment_id: 1)

Upload a Model (multipart/form-data)

Simulate Evaluation of an Uploaded Model

List Recent Benchmark Results

Compare Models

Research Context — RL Infrastructure Landscape

Testing

Quick Test Run

Expected Results

Complete Testing Documentation

Road Ahead

License

👤 Author

Contributing

Citation (Academic Use)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages