Reinforcement Learning in GridWorld

A comparative study of Q-Learning, SARSA, and Dyna-Q with planning and robustness analysis.

Project Overview

This project implements and analyzes core tabular reinforcement learning algorithms inside a custom stochastic GridWorld environment.

A full GridWorld environment with walls, pits, wind (stochasticity), rewards, indexing logic, and deterministic seeding.
Algorithms Implemented:
- Q-Learning (off-policy TD control)
- SARSA(0) (on-policy TD control)
- Dyna-Q (model-based RL + planning updates)
Experiments on:
- Planning sweep (varying Dyna-Q planning steps $K$)
- Robustness across environment layouts & random seeds
- Final Policy evaluation
A complete unit test suite for all algorithms, GridWorld, and utils.
Well-structured Jupyter notebooks for analysis.
Reusable experiment utilities (seed experiments, plotting, train-with-logs functions).

Goal: Understand sample efficiency, stability, and policy robustness across model-free and model-based RL methods in controlled environments.

Project Structure

reinforcement_learning/
│
├── src/rl_capstone/
│   ├── gridworld.py        # Environment implementation
│   ├── rl_algorithms.py    # Q-Learning, SARSA, Dyna-Q + logging variants
│   ├── utils.py            # Action selection, schedules, evaluation, plotting
│   └── __init__.py
│
├── notebooks/
│   ├── 00_RL.ipynb
│   ├── 01_q_learning.ipynb
│   ├── 02_sarsa.ipynb
│   ├── 03_dyna_q.ipynb
│   ├── 04_comparison_models.ipynb
│   ├── 05_k_sweep.ipynb
│   ├── 06_robustness.ipynb
│   └── 07_results.ipynb
│
├── tests/
│   ├── test_gridworld.py
│   ├── test_rl_algorithms.py
│   └── test_utils.py
│
├── data/
│   ├── q_tables/           # Saved NumPy Q-tables
│   └── robustness/         # npz files for seed stability experiments
│
├── reports/                
│   ├── figs/               # Stores the result's graphs
│   ├── Report.tex/         # LaTex file
│   └── Report.pdf/         # PDF or markdown summary
│
├── requirements.txt
└── README.md

Environment Setup

This project uses a local Python virtual environment (.venv) and Jupyter notebook for analysis. The virtual environment is not committed to Github, so you must create it after cloning the repository.

1. Clone the Repository

git clone https://github.com/fcampoverdeg/reinforcement_learning.git
cd reinforcement_learning

2. Create and Activate a Virtual Environment

python3 -m venv .venv
source .venv/bin/activate   # on macOS/Linux
# .venv\Scripts\activate    # On Windows PowerShell

You should now see (.venv) at the beggining of the shell prompt

3. Install Dependencies

pip install -r requirements.txt

# In case 'requirements.txt' is not available
pip install numpy scipy matplotlib jupyterlab ipykernel pandas tqdm \
            black ruff pytest pytest-cov mypy gymnasium pygame

# Install packages in editable mode
pip install -e

4. Run Unit Tests (Recommended)

Ensures GridWorld + all algorithms work correctly.

pytest -q

5. Register Jupyter Kernel

This step makes your virtual environment visible inside JupyterLab:

python -m ipykernel install --user --name gridrl --display-name "Python (gridrl)"

6. Launch JupyterLab

jupyter lab

Research Questions Addressed

How do Dyna-Q planning steps $K$ affect sample efficiency?
How sensitive are Q-learning, SARSA, and Dyna-Q to ε-greedy schedules?
How robust are policies across seeds, layout changes, and wind noise?
Does model-based planning consistently improve stability and convergence?

All findings are documented in the Results notebook

Author

Felipe Campoverde
Virginia Tech - RL Capstone Research Project

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reinforcement Learning in GridWorld

Project Overview

Project Structure

Environment Setup

1. Clone the Repository

2. Create and Activate a Virtual Environment

3. Install Dependencies

4. Run Unit Tests (Recommended)

5. Register Jupyter Kernel

6. Launch JupyterLab

Research Questions Addressed

Author

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
data		data
notebooks		notebooks
reports		reports
src/rl_capstone		src/rl_capstone
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Start_Here.ipynb		Start_Here.ipynb
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

License

fcampoverdeg/reinforcement_learning

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning in GridWorld

Project Overview

Project Structure

Environment Setup

1. Clone the Repository

2. Create and Activate a Virtual Environment

3. Install Dependencies

4. Run Unit Tests (Recommended)

5. Register Jupyter Kernel

6. Launch JupyterLab

Research Questions Addressed

Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages