@@ -19,23 +19,19 @@ In these tutorials for reinforcement learning, it covers from the basic RL algor
19
19
# Table of Contents
20
20
21
21
* [ Simple entry example] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/1_command_line_reinforcement_learning )
22
- * Tabular Methods
23
- * [ Q-learning] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/2_Q_Learning_maze )
24
- * [ Sarsa] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/3_Sarsa_maze )
25
- * [ Sarsa(lambda)] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/4_Sarsa_lambda_maze )
26
- * Function Approximation (DQN)
27
- * [ Deep Q Network] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5_Deep_Q_Network )
22
+ * [ Q-learning] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/2_Q_Learning_maze )
23
+ * [ Sarsa] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/3_Sarsa_maze )
24
+ * [ Sarsa(lambda)] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/4_Sarsa_lambda_maze )
25
+ * [ Deep Q Network] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5_Deep_Q_Network )
28
26
* [ Using OpenAI Gym] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/6_OpenAI_gym )
29
- * DQN-based methods
30
- * [ Double DQN] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.1_Double_DQN )
31
- * [ DQN with Prioitized Experience Replay] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.2_Prioritized_Replay_DQN )
32
- * [ Dueling DQN] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.3_Dueling_DQN )
27
+ * [ Double DQN] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.1_Double_DQN )
28
+ * [ DQN with Prioitized Experience Replay] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.2_Prioritized_Replay_DQN )
29
+ * [ Dueling DQN] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.3_Dueling_DQN )
33
30
* [ Policy Gradients] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/7_Policy_gradient_softmax )
34
31
* [ Actor Critic] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/8_Actor_Critic_Advantage )
35
- * [ Deep Deterministic Policy Gradient] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/9_Deep_Deterministic_Policy_Gradient_DDPG )
36
- * [ A3C] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/10_A3C )
37
- * Model-based RL (WIP)
38
- * [ Dyna-Q] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/11_Dyna_Q )
32
+ * [ Deep Deterministic Policy Gradient] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/9_Deep_Deterministic_Policy_Gradient_DDPG )
33
+ * [ A3C] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/10_A3C )
34
+ * [ Dyna-Q] ( https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/11_Dyna_Q )
39
35
40
36
41
37
# Donation
0 commit comments