Online Policy Adaptation for Networked Systems using Rollout

Dynamic resource allocation in networked systems is necessary to achieve end-to-end management objectives. Previous research has demonstrated that reinforcement learning is a promising approach to this problem, allowing to obtain near-optimal resource allocation policies for non-trivial system configurations. Despite these advances, a significant drawback of current approaches is that they require expensive and slow retraining whenever the target system changes. We address this drawback and introduce an efficient approach to adapt a given base policy to dynamic system changes. In our approach, we adapt the base policy through rollout and online play, which transforms the base policy into a rollout policy.

The following figure shows our approach for policy adaptation in networked systems. During each control cycle, the system model $f$ is estimated from system metrics using supervised learning; a given base policy $\hat{\pi}$ is adapted for the current state and the current system model through one step of policy iteration, which we call rollout and the output of this step is an improved rollout policy $\tilde{\pi}$ which is used to select the next control action.

Requirements

gym and gymnasium: for creating the RL environments
joblib: for loading/exporting random forest regressor models
sb3-contrib: for reinforcement learning agents (Maskable PPO)
scikit-learn: for random forest regression
scipy: for random forest regression
stable-baselines3: for reinforcement learning agents (PPO)
torch and torchvision: for neural network training
matplotlib: for plotting
pandas: for data wrangling
requests: for making HTTP requests

Development Requirements

Python 3.8+
flake8 (for linting)
tox (for automated testing)

Installation

# install from pip
pip install online_policy_adaptation_using_rollout==<version>
# local install from source
$ pip install -e online_policy_adaptation_using_rollout
# or (equivalently):
make install
# force upgrade deps
$ pip install -e online_policy_adaptation_using_rollout --upgrade
# git clone and install from source
git clone https://github.com/foroughsh/online_policy_adaptation_using_rollout
cd online_policy_adaptation_using_rollout
pip3 install -e .
# Install development dependencies
$ pip install -r requirements_dev.txt

Run Experiments

Scenario 1

cd examples; python run_scenario_1.py

Scenario 2

cd examples; python run_scenario_2.py

Scenario 3

cd examples; python run_scenario_3.py

Copyright and license

Authors & Maintainers

Forough Shahabsamani foro@kth.se
Kim Hammar kimham@kth.se

Name		Name	Last commit message	Last commit date
Latest commit History 127 Commits
application		application
collected_data_traces_from_testbed		collected_data_traces_from_testbed
documentation		documentation
examples		examples
src/online_policy_adaptation_using_rollout		src/online_policy_adaptation_using_rollout
trained_models		trained_models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
online_policy_adaptation.png		online_policy_adaptation.png
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Online Policy Adaptation for Networked Systems using Rollout

Requirements

Development Requirements

Installation

Run Experiments

Scenario 1

Scenario 2

Scenario 3

Copyright and license

Authors & Maintainers

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

foroughsh/online_policy_adaptation_using_rollout

Folders and files

Latest commit

History

Repository files navigation

Online Policy Adaptation for Networked Systems using Rollout

Requirements

Development Requirements

Installation

Run Experiments

Scenario 1

Scenario 2

Scenario 3

Copyright and license

Authors & Maintainers

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages