Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
reinforcement-learning
pytorch
knowledge-graph
policy-gradient
reward-shaping
action-dropout
multi-hop-reasoning
-
Updated
Oct 4, 2024 - Jupyter Notebook