Use the training end_e
as the evaluation(..., epsilon=end_e)
for atari
#879
Job | Run time |
---|---|
2m 11s | |
2m 11s |
end_e
as the evaluation(..., epsilon=end_e)
for atari
#879
Job | Run time |
---|---|
2m 11s | |
2m 11s |