Repository containing basic algorithm applied in python.
Reinforcement Learning
βββ DeepRL
βΒ Β βββ Actor_Critic
βΒ Β βββ Actor_Critic_Uni_model
βΒ Β βββ DQN
βΒ Β βΒ Β βββ Vanilla_DQN
βΒ Β βββ Reinforce
βΒ Β βββ off_policy
βΒ Β βββ on_policy
βββ DynamicProgramming
βΒ Β βββ multi-arm-bandit
βΒ Β βββ policy_iteration
βΒ Β βββ value_iteration
βββ MonteCarlo
βΒ Β βββ monte_carlo_continous_env
βΒ Β βββ monte_carlo_epsilon_greedy_exploration
βΒ Β βββ monte_carlo_every_visit
βΒ Β βββ monte_carlo_exploring
βΒ Β βββ monte_carlo_first_visit
βΒ Β βββ monte_carlo_off_policy_control
βΒ Β βββ monte_carlo_off_policy_prediction
βΒ Β βββ monte_carlo_state_aggregation
βΒ Β βββ monte_carlo_tree_search
βββ TemporalDifference
βββ temporal_difference_dyna_Q
βββ temporal_difference_expected_sarsa
βββ temporal_difference_n_step_sarsa
βββ temporal_difference_off_policy_Q_learning
βββ temporal_difference_on_policy_sarsa
βββ temporal_difference_state_aggregation
βββ temporal_difference_zero