This is a blog post based on the paper Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble, which proposes the offline RL algorithm EDAC (Ensemble Diversified Actor Critic).
There are several implementations:
- install Quarto
- clone this repo
pip install -r blog/requirements.txt
quarto preview blog/blog.ipynb
The main content is in blog/blog.ipynb
, the references are in blog/references.bib
and the images are in blog/figures/
.