@@ -18,23 +18,23 @@ In these tutorials for reinforcement learning, it covers from the basic RL algor
18
18
19
19
# Table of Contents
20
20
21
- * [ Simple entry example ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/1_command_line_reinforcement_learning )
22
- * [ Q-learning ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/2_Q_Learning_maze )
23
- * [ Sarsa ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/3_Sarsa_maze )
24
- * [ Sarsa(lambda) ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/4_Sarsa_lambda_maze )
25
- * [ Deep Q Network ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5_Deep_Q_Network )
26
- * [ Using OpenAI Gym ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/6_OpenAI_gym )
27
- * [ Double DQN ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.1_Double_DQN )
28
- * [ DQN with Prioitized Experience Replay ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.2_Prioritized_Replay_DQN )
29
- * [ Dueling DQN] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.3_Dueling_DQN )
30
- * [ Policy Gradients ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/7_Policy_gradient_softmax )
31
- * [ Actor Critic ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/8_Actor_Critic_Advantage )
32
- * [ Deep Deterministic Policy Gradient ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/9_Deep_Deterministic_Policy_Gradient_DDPG )
33
- * [ A3C ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/10_A3C )
34
- * [ Dyna-Q ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/11_Dyna_Q )
35
- * [ Proximal Policy Optimization (PPO) ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/12_Proximal_Policy_Optimization )
36
-
37
- ### [ Some of my experients] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments )
21
+ * Tutorials
22
+ * [ Simple entry example ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/1_command_line_reinforcement_learning )
23
+ * [ Q-learning ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/2_Q_Learning_maze )
24
+ * [ Sarsa] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/3_Sarsa_maze )
25
+ * [ Sarsa(lambda) ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/4_Sarsa_lambda_maze )
26
+ * [ Deep Q Network ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5_Deep_Q_Network )
27
+ * [ Using OpenAI Gym ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/6_OpenAI_gym )
28
+ * [ Double DQN ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.1_Double_DQN )
29
+ * [ DQN with Prioitized Experience Replay ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.2_Prioritized_Replay_DQN )
30
+ * [ Dueling DQN ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.3_Dueling_DQN )
31
+ * [ Policy Gradients ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/7_Policy_gradient_softmax )
32
+ * [ Actor Critic ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/8_Actor_Critic_Advantage )
33
+ * [ Deep Deterministic Policy Gradient ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/9_Deep_Deterministic_Policy_Gradient_DDPG )
34
+ * [ A3C ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/10_A3C )
35
+ * [ Dyna-Q ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/11_Dyna_Q )
36
+ * [ Proximal Policy Optimization (PPO) ] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/12_Proximal_Policy_Optimization )
37
+ * [ Some of my experients] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments )
38
38
39
39
# Donation
40
40
0 commit comments