The pytorch implemetation of trpo
-
Updated
Mar 14, 2022 - Python
The pytorch implemetation of trpo
My solutions to the labs from this bootcamp:
Scheduling TRPO's KL Divergence Constraint
Benchmarking the Natural Gradient in Policy Gradient Methods and Evolution Strategies
Undergraduate Dissertation (University of Malta) 2020-2023 - 'Autonomous Drone Control using Reinforcement Learning''
A collection of Reinforcement Learning implementations with PyTorch
works about solving nonlinear dynamic systems
Course projects of CS395T Numerical Optimization, UT Austin
Python implementation of some numerical (optimization) methods
A trading bitcoin agent was created with deep reinforcement learning implementations.
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
Trust Region Policy Optimization (TRPO) in pure TensorFlow
PyTorch implementation of Trust Region Policy Optimization
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Add a description, image, and links to the trust-region-policy-optimization topic page so that developers can more easily learn about it.
To associate your repository with the trust-region-policy-optimization topic, visit your repo's landing page and select "manage topics."