Monotonic Evolution Reinforcement Learning

😄 This work has been accepted in 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
🚀 This work implements a novel Reinforcement Learning (RL) approach for autonomous driving with monotonic evolution capability. The algorithm ensures continuous policy improvement with a high confidence guarantee.

Highlight

Monotonic performance enhancement by high confidence policy improvement
Ensure the safe and robust online training
Integrate both decision-making and motion planning

Project Structure

main.py: Main training script for the reinforcement learning algorithm
monotonic_evolution_RL.py: Implementation of the PPO (Proximal Policy Optimization) algorithm
normalization.py: State and reward normalization utilities
replaybuffer.py: Experience replay buffer for storing transitions
VissimEnvironment.py: Interface between VISSIM traffic simulation and the RL algorithm

Requirements

Windows operating system (required for VISSIM integration)
VISSIM traffic simulation software (version 22)
Python 3.8
Conda package manager

Installation

Install VISSIM 22 on your Windows system

Create and activate the conda environment:

conda env create -f environment.yml
conda activate monotonic_evolution_rl

Verify VISSIM is properly installed and accessible via COM interface

Usage

Run the main training script:

python main.py

You can modify hyperparameters using command line arguments, for example:

python main.py --max_train_steps 500000 --gamma 0.98

Configuration

The default hyperparameters can be found in main.py. You can customize:

--max_train_steps: Maximum number of training steps
--gamma: Discount factor for future rewards
--hidden_width: Width of hidden layers in networks
And many other PPO-specific parameters

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Monotonic Evolution Reinforcement Learning

Highlight

Project Structure

Requirements

Installation

Usage

Configuration

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
PPO_preTrained/Highway_HCPI		PPO_preTrained/Highway_HCPI
Vissim		Vissim
__pycache__		__pycache__
data_train		data_train
README.md		README.md
VissimEnvironment.py		VissimEnvironment.py
environment.yml		environment.yml
main.py		main.py
monotonic_evolution_RL.py		monotonic_evolution_RL.py
normalization.py		normalization.py
replaybuffer.py		replaybuffer.py

FrankRun/Monotonic_Evolution_RL

Folders and files

Latest commit

History

Repository files navigation

Monotonic Evolution Reinforcement Learning

Highlight

Project Structure

Requirements

Installation

Usage

Configuration

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages