Using Decision Transformer as a Booster

This repository contains the code and data for our research project on the Decision Transformer (DT) as a booster for other learners in a reinforcement learning setting. The project investigates if the offline, batch learning capabilities of DT can be applied on other agents to generalize to more optimal actions.

Introduction

Reinforcement Learning (RL) is a powerful paradigm for developing agents that can learn to make decisions in complex environments. However, RL algorithms can be slow and sample inefficient when learning through online interaction with the environment. The Decision Transformer (DT) offers a promising alternative by learning offline from a dataset of trajectories. DT has shown impressive results in several domains, including Atari games, but it requires access to an expert dataset of trajectories.

In this research project, we investigate whether the DT can be used as a booster for other learners in a reinforcement learning setting. Specifically, we aim to collect and use two datasets of trajectories, one containing random trajectories and the other containing expert trajectories generated by a Q-learning agent. We will then train the DT on these datasets and compare its performance with that of a Q-learning agent and a more sophisticated expert learner, the Deep Q-Network (DQN).

Research Questions

Can the DT effectively learn from the expert dataset of trajectories to outperform other learners? How do the hyperparameters of the DT affect its performance, and can we identify optimal hyperparameters? Can the DT be used as a booster for other learners in a reinforcement learning setting, and does it provide a performance gain over the baseline learners?

Getting Started

Clone our repository with git clone git@github.com:haraldger/DRL-DecisionTransformer.git (preferred) or with git clone https://github.com/haraldger/DRL-DecisionTransformer.git to a desired directory. Navigate to the directory: cd DRL-DecisionTransformer.
(Recommended) Create a virtual environment for package management and separation. In the project directory

python -m venv ./venv
source venv/bin/activate

To activate the venv on Windows:

venv\scripts\activate.bat

Make sure pip is updated: pip install --upgrade pip
Install dependencies from requirements file: pip install -r requirements.txt
Launch the random agent (basic) with: python main.py

For development:

To add dependencies, follow these steps:

Make sure you are working on a virtual environment, with only the dependencies of the project installed.
To list installed packages: pip freeze
When the new required package is installed through pip, run: pip freeze > requirements.txt.

Name		Name	Last commit message	Last commit date
Latest commit History 394 Commits
Agents		Agents
networks		networks
results		results
tests		tests
utils		utils
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
collect_random_agent_trajectories.py		collect_random_agent_trajectories.py
dt_main.py		dt_main.py
main.py		main.py
requirements.txt		requirements.txt
run_tests.py		run_tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Using Decision Transformer as a Booster

Introduction

Research Questions

Getting Started

For development:

About

Releases

Packages

Contributors 2

Languages

haraldger/DRL-DecisionTransformer

Folders and files

Latest commit

History

Repository files navigation

Using Decision Transformer as a Booster

Introduction

Research Questions

Getting Started

For development:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages