Balancing Act: Mastering the Inverted Double Pendulum with Soft Actor-Critic

The inverted double pendulum is a hallmark of control theory, renowned for its instability and nonlinear dynamics. This project explores the challenge of stabilizing this system using the Soft Actor-Critic (SAC) algorithm, a state-of-the-art reinforcement learning method, within the MuJoCo physics engine. Through empirical experimentation, we harness SAC to develop a robust control strategy that balances the double pendulum upright with minimal torque, effectively navigating its complex behavior. Simulation results highlight SAC’s capability to adaptively learn policies for this demanding task, offering practical insights into its application for continuous control problems. This project demonstrates the power of SAC in addressing intricate dynamical systems and contributes to the growing field of reinforcement learning in control theory.

Experiment

Play around with this notebook to gain knowledge of balancing the inverted double pendulum with SAC.

Result

Reward Curve

The green line denotes rewards and the blue line indicates the best moving average. The best-moving average is gathered by applying an arithmetic mean to the reward that is better than the current best-moving average. This is used to decide whether the episode is worth keeping or not.

Qualitative Result

These are the evolution of the control of the inverted double pendulum. The control is progressively better.

Episode 0

Episode 400

Final Episode

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
Inverted_Double_Pendulum_SAC.ipynb		Inverted_Double_Pendulum_SAC.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Balancing Act: Mastering the Inverted Double Pendulum with Soft Actor-Critic

Experiment

Result

Reward Curve

Qualitative Result

Credit

About

Uh oh!

Releases

Packages

Languages

reshalfahsi/inverted-double-pendulum-sac

Folders and files

Latest commit

History

Repository files navigation

Balancing Act: Mastering the Inverted Double Pendulum with Soft Actor-Critic

Experiment

Result

Reward Curve

Qualitative Result

Credit

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages