Skip to content

reshalfahsi/inverted-double-pendulum-sac

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Balancing Act: Mastering the Inverted Double Pendulum with Soft Actor-Critic

colab

The inverted double pendulum is a hallmark of control theory, renowned for its instability and nonlinear dynamics. This project explores the challenge of stabilizing this system using the Soft Actor-Critic (SAC) algorithm, a state-of-the-art reinforcement learning method, within the MuJoCo physics engine. Through empirical experimentation, we harness SAC to develop a robust control strategy that balances the double pendulum upright with minimal torque, effectively navigating its complex behavior. Simulation results highlight SAC’s capability to adaptively learn policies for this demanding task, offering practical insights into its application for continuous control problems. This project demonstrates the power of SAC in addressing intricate dynamical systems and contributes to the growing field of reinforcement learning in control theory.

Experiment

Play around with this notebook to gain knowledge of balancing the inverted double pendulum with SAC.

Result

Reward Curve

reward_curve
The green line denotes rewards and the blue line indicates the best moving average. The best-moving average is gathered by applying an arithmetic mean to the reward that is better than the current best-moving average. This is used to decide whether the episode is worth keeping or not.

Qualitative Result

These are the evolution of the control of the inverted double pendulum. The control is progressively better.

Episode 0 Episode 400 Final Episode

Credit

About

Balancing Act: Mastering the Inverted Double Pendulum with Soft Actor-Critic

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published