PER-NAF

An implementation of the Normalized Advantage Function Reinforcement Learning Algorithm with Prioritized Experience Replay

Summary

Thanks openAI and Kim!

Normalize the state and action space as well as the reward is a good practice
Visualise as much as possible to get an intuition about the method as possible bugs
If it does not make sense it is a bug with very high probability

Coding makes happy 🙃

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
InfrastructuralData		InfrastructuralData
NAF_debug		NAF_debug
pernaf		pernaf
pernaf2		pernaf2
LICENSE.txt		LICENSE.txt
Package_test.py		Package_test.py
README.md		README.md
__init__.py		__init__.py
basic_test_script.py		basic_test_script.py
electron_tt43.out		electron_tt43.out
naf2.py		naf2.py
naf2test.py		naf2test.py
new_test_file.py		new_test_file.py
simple_environment.py		simple_environment.py
simulated_environment_final.py		simulated_environment_final.py
tf_stst_2.py		tf_stst_2.py