Gym Box2D Project

Introduction

This project contains the implementation of reinforcement learning algorithms to solve the Lunar Lander and Bipedal Walker environments using the DQN and DDPG algorithms respectively.

Environments

DQN Lunar Lander	TD3 Bipedal Walker easy	TD3 Bipedal Walker Hardcore (wip)

Lunar Lander

The Lunar Lander environment is a classic control problem where the goal is to navigate a spacecraft to land on a designated landing pad while avoiding obstacles and minimizing fuel consumption. The observation space is continuous and consists of eight variables representing the position, velocity, and angle of the spacecraft, and the landing pad location. The action space is discrete with four possible actions - do nothing, fire left orientation engine, fire main engine, fire right orientation engine.

Bipedal Walker

The Bipedal Walker environment is a control problem where the goal is to control a 2D bipedal robot to walk and reach the end of a track without falling. The observation space is continuous and consists of 24 variables representing the position, velocity, and angular velocity of the robot and the ground. The action space is continuous with four values representing the torque applied to the two hip and knee joints.

Dependencies

This project has the following dependencies:

numpy
jupyterlab
matplotlib
six
pyglet
imageio
imageio[ffmpeg]
scipy
PyOpenGL
gym
gym[box2d]
Pillow
opencv-python
ipython
nodejs-bin
jupyter_contrib_nbextensions
tensorflow
ipywidgets
pandas
tensorflow_probability

These dependencies can be installed using pipenv with the following command:

Installation

These packages can be installed by running the following command:

pipenv shell
pipenv install --requirements "requirements.txt"
pipenv install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

to install ipython

jupyter nbextensions_configurator enable --user
jupyter nbextension enable --py widgetsnbextension

Project Structure

.
├── img
│   ├── bipedal_walker_easy
│   │   └──bipedal_walker_easy.gif
│   └── lunar_lander
|       └──lunar_lander.gif
├── Pipfile
├── Pipfile.lock
├── Lunar_Lander.ipynb
├── Bipedal_Walker.ipynb
├── README.md
├── LICENSE.md
└── requirements.txt

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

Acknowledgments

OpenAI Gym

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
img		img
saves		saves
.gitignore		.gitignore
Bipedal_Walker_easy.ipynb		Bipedal_Walker_easy.ipynb
Bipedal_Walker_hard.ipynb		Bipedal_Walker_hard.ipynb
LICENCE.md		LICENCE.md
Lunar_Lander.ipynb		Lunar_Lander.ipynb
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
Readme.md		Readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gym Box2D Project

Introduction

Environments

Lunar Lander

Bipedal Walker

Dependencies

Installation

Project Structure

License

Acknowledgments

About

Releases

Packages

Contributors 2

Languages

License

thibaud-perrin/box2d

Folders and files

Latest commit

History

Repository files navigation

Gym Box2D Project

Introduction

Environments

Lunar Lander

Bipedal Walker

Dependencies

Installation

Project Structure

License

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages