Performance evaluation and comparison of algorithms #108

muupan · 2017-06-10T08:33:46Z

It will be great to add performance evaluation and comparisons of algorithms available in ChainerRL.

ghost · 2017-06-10T09:22:04Z

Indeed, I like TRPO with exact second derivatives/Hessian-vector products :) Has nice theoretical properties

muupan · 2017-06-11T01:24:54Z

I agree TRPO is great, but supporting TRPO is off-topic on this issue.

https://github.com/openai/rllab and https://github.com/openai/baselines are doing such evaluation and comparison really well, so it's good to start from them.

ghost · 2017-06-11T01:43:12Z

There' also PyTorch implementations which I'm currently using, (once chainer has second derivatives it will be possible to port these over),

ChainerRL seems to be very promising as an alternative to the openai repos given.

muupan · 2017-07-31T21:00:31Z

muupan · 2017-08-02T22:27:04Z

Added DoubleDQN and PAL.

muupan · 2017-08-04T21:49:32Z

Added DQN with prioritized replay

muupan added enhancement performance labels Jun 11, 2017

muupan mentioned this issue Oct 23, 2017

Include the results of examples in the repository #156

Open

12 tasks

prabhatnagarajan added the prio:high label Nov 6, 2018

Provide feedback