MarioRL 🎮

Train a PPO agent to play Super Mario Bros using Stable-Baselines3 on stable-retro. This project targets Linux (Ubuntu 22.04 LTS on WSL) and uses uv for a fast, reproducible Python environment.

A single training entrypoint (main.py) is provided along with a separate runner (play.py) to play the full game with a trained policy.

Overview • Requirements • Important Links • Quickstart • ROM Import • Train • Play • Monitoring • Tips • Legal

Overview 🧭

Stable-Baselines3 (PPO) with stable-retro
Linux-first workflow (Ubuntu 22.04 LTS on WSL)
ROM import instructions
CNN Policy applied
TensorBoard monitoring to analyze the performance
Custom game window using OpenCV [Optional]
Runner script to play the full game using a trained policy

Important Links 🔗

The compiled understandings can be found in this PDF
All resources are available on TLDRAW board

Requirements 📋

Ubuntu 22.04 LTS (native or WSL)
Python 3.10
uv, FFmpeg, and OpenGL
A legally obtained Super Mario Bros NES ROM

Note

Ubuntu 22.04 LTS includes Python 3.10 by default.

Quickstart (Windows • WSL • VS Code) 🚀

Install Ubuntu 22.04 (WSL):

wsl --install --distribution Ubuntu-22.04

Open Ubuntu from the Start menu and continue in that terminal.

Clone and open:

git clone https://github.com/amugoodbad229/MarioRL.git
cd MarioRL
code .

System dependencies:

sudo apt-get update -y && sudo apt-get upgrade -y
sudo apt-get install -y git curl zlib1g-dev libssl-dev ffmpeg python3-opengl

Install uv:

curl -LsSf https://astral.sh/uv/install.sh | sh
uv --version

Sync the environment (creates .venv and installs dependencies):

uv sync

Activate the environment:

source .venv/bin/activate

ROM Import 📥

Place your ROM:

mkdir -p ROM
# Copy your legally obtained Super Mario Bros.nes file into ./ROM

Import into Retro’s database:

python3 -m retro.import ./ROM

Ensure the game name (e.g., SuperMarioBros-Nes) matches what your scripts expect.

Quick Test: Random Agent 🧪

Run a quick environment sanity check:

python3 randomAgent.py

Important

Terminate the process with Ctrl+C.

OPTIONAL: check CUDA availability for PyTorch

python3 -c "import torch; print(torch.cuda.is_available())"

Warning

Set device = "cuda", as we are dealing with images

Train 🏋️

Run the training entrypoint:

python3 main.py

Play the Full Game ▶️

OPTIONAL: You can select the path file to your desired folder. For now, I have set it up for you.

After training, run the separate runner to play:

python3 play.py

Use the same observation wrappers in play.py that you used in training.

Monitoring 📈

Launch TensorBoard:

python -m tensorboard.main --logdir tensorboard

Open the printed URL in your browser to view training curves and metrics.

Tips 💡

Ensure the environment is active:

source .venv/bin/activate

Re-run sync if dependencies change:

uv sync

Re-import ROMs if not detected:

python3 -m retro.import ./ROM

Important

If Imported 0 games shows up, it could mean you downloaded a corrupted copy.

Clean stop vs suspend:

# Clean stop
# Press Ctrl+C

# If you pressed Ctrl+Z by mistake:
# enter fg on the terminal to bring it to foreground, then press Ctrl+C

Acknowledgements 🙏

Support 🤝

If this repository helps you learn or prototype faster:

Star the repo
Share feedback and ideas
Open issues for bugs or improvements

Legal ⚖️

ROMs are not included and must be obtained legally. Do not commit or distribute ROMs in this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
main.py		main.py
play.py		play.py
pyproject.toml		pyproject.toml
randomAgent.py		randomAgent.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MarioRL 🎮

Overview 🧭

Important Links 🔗

Requirements 📋

Quickstart (Windows • WSL • VS Code) 🚀

ROM Import 📥

Quick Test: Random Agent 🧪

Train 🏋️

Play the Full Game ▶️

Monitoring 📈

Tips 💡

Acknowledgements 🙏

Support 🤝

Legal ⚖️

About

Uh oh!

Languages

License

amugoodbad229/MarioRL

Folders and files

Latest commit

History

Repository files navigation

MarioRL 🎮

Overview 🧭

Important Links 🔗

Requirements 📋

Quickstart (Windows • WSL • VS Code) 🚀

ROM Import 📥

Quick Test: Random Agent 🧪

Train 🏋️

Play the Full Game ▶️

Monitoring 📈

Tips 💡

Acknowledgements 🙏

Support 🤝

Legal ⚖️

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages