Movie Recommender System using AIRL(adverserial inverse reinforcement learning)

This is a PyTorch implementation of Adversarial Inverse Reinforcement Learning(AIRL)[1] and GAIL[2] based on DDPG[2] for mine master thesis. The implementation took the code references of [3]-[10]. Also thank you for Prof. Dr. Heinz Köppl[11] and supervisor[12] giving me this chance to implement my ideas as graduation work.

Setup

You can install Python liblaries using pip3 install -r requirements.txt.

Example

train forward model:

python3 train_rl.py

train inverse model:

python3 train_imitation.py

run online server: extract rswebsite.zip, install maven dependencies in pom.xml then run main function in RecServer. Open http://localhost:6016/ in browser.

Run deployed model via TensorFlow Serving(in windows), docker download link: https://www.docker.com/. Open docker and then:

docker run -t --rm -p 8501:8501  -v ".\rswebsite\src\main\resources\webroot\modeldata\rl:/models/recmodel"  -e MODEL_NAME=recmodel tensorflow/serving

Demo

References

[1] Fu, Justin, Katie Luo, and Sergey Levine. "Learning robust rewards with adversarial inverse reinforcement learning." arXiv preprint arXiv:1710.11248 (2017).
[2] Timothy P. Lillicrap et.al: "Continuous control with deep reinforcement learning".
[3] machine learning of mofan
[4] Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
[5] gail-airl-ppo.pytorch project by Toshiki Watanabe
[6] ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (template) in PyTorch.
[7] Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
[8] A Deep Learning Recommender System
[9] Deep Reinforcement Learning based Recommender System by Young_Painter_L,kyunghoon-jung and pasus
[10] practice on movielens using pytorch
[11] Prof. Dr. Heinz Köppl
[12] Matthias Schultheis
[13] Haarnoja, Tuomas, et al. "Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor." arXiv preprint arXiv:1801.01290 (2018).\

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.idea		.idea
__pycache__		__pycache__
data		data
images		images
model		model
rswebsite		rswebsite
wandb		wandb
weights		weights
README.md		README.md
airl.py		airl.py
algo.py		algo.py
base.py		base.py
buffer.py		buffer.py
collect_demo.py		collect_demo.py
ddpg_.py		ddpg_.py
demostration.npy		demostration.npy
disc.py		disc.py
embedding.ipynb		embedding.ipynb
embedding.py		embedding.py
env.py		env.py
evaluate.py		evaluate.py
gail.py		gail.py
network.py		network.py
network_.py		network_.py
ppo2.py		ppo2.py
requirements.txt		requirements.txt
ringbuffer.py		ringbuffer.py
save_model.py		save_model.py
state.py		state.py
td3.py		td3.py
test.py		test.py
train_imitation.py		train_imitation.py
train_rl.py		train_rl.py
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Movie Recommender System using AIRL(adverserial inverse reinforcement learning)

Setup

Example

Demo

References

About

Releases

Packages

Languages

haiyufirefish/AIRL

Folders and files

Latest commit

History

Repository files navigation

Movie Recommender System using AIRL(adverserial inverse reinforcement learning)

Setup

Example

Demo

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages