This is a self-learning note for reinforcement learning.
My reading is Reinforcement Learning: An Introduction which is written by Richard S.Sutton and Andrew G.Barto.
There are chinese notes on my hackmd, and only script on my GitHub.
I will move note to GitHubPage when someday I have time.