Skip to content

TheUnsolvedDev/ReinforcementLearning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Reinforcement-Learning-Study

Repository containing basic algorithm applied in python.

Reinforcement Learning
β”œβ”€β”€ DeepRL
β”‚Β Β  β”œβ”€β”€ Actor_Critic
β”‚Β Β  β”œβ”€β”€ Actor_Critic_Uni_model
β”‚Β Β  β”œβ”€β”€ DQN
β”‚Β Β  β”‚Β Β  └── Vanilla_DQN
β”‚Β Β  └── Reinforce
β”‚Β Β      β”œβ”€β”€ off_policy
β”‚Β Β      └── on_policy
β”œβ”€β”€ DynamicProgramming
β”‚Β Β  β”œβ”€β”€ multi-arm-bandit
β”‚Β Β  β”œβ”€β”€ policy_iteration
β”‚Β Β  └── value_iteration
β”œβ”€β”€ MonteCarlo
β”‚Β Β  β”œβ”€β”€ monte_carlo_continous_env
β”‚Β Β  β”œβ”€β”€ monte_carlo_epsilon_greedy_exploration
β”‚Β Β  β”œβ”€β”€ monte_carlo_every_visit
β”‚Β Β  β”œβ”€β”€ monte_carlo_exploring
β”‚Β Β  β”œβ”€β”€ monte_carlo_first_visit
β”‚Β Β  β”œβ”€β”€ monte_carlo_off_policy_control
β”‚Β Β  β”œβ”€β”€ monte_carlo_off_policy_prediction
β”‚Β Β  β”œβ”€β”€ monte_carlo_state_aggregation
β”‚Β Β  └── monte_carlo_tree_search
└── TemporalDifference
    β”œβ”€β”€ temporal_difference_dyna_Q
    β”œβ”€β”€ temporal_difference_expected_sarsa
    β”œβ”€β”€ temporal_difference_n_step_sarsa
    β”œβ”€β”€ temporal_difference_off_policy_Q_learning
    β”œβ”€β”€ temporal_difference_on_policy_sarsa
    β”œβ”€β”€ temporal_difference_state_aggregation
    └── temporal_difference_zero

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •