Skip to content

Pytorch implementation of baseline models of KQA Pro, a large-scale dataset of complex question answering over knowledge base.

License

Notifications You must be signed in to change notification settings

shijx12/KQAPro_Baselines

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

62 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

KQA Pro Baselines

KQA Pro is a large-scale dataset of complex question answering over knowledge base, which provides strong supervision of SPARQL and program for each question. Here is its homepage website. This dataset is licensed under a Creative Commons Attribution-ShareAlike 4.0 International.

This repo implements several baselines for the dataset:

  • Blind GRU. It predicts the answer in terms of only the input question, ignoring the knowledge base. We use it to measure the dataset bias.
  • KVMemNN (Key-Value Memory Networks)
  • RGCN (Relational Graph Convolutional Networks)
  • SRN (Stepwise Relational Networks)
  • RNN seq2seq SPARQL parser
  • RNN seq2seq program parser
  • BART seq2seq SPARQL parser
  • BART seq2seq program parser

Instructions of how to run these models are described in their README files. Before trying them, you need to first download the dataset and unzip it into the folder ./dataset. The file tree should be like

.
+-- dataset
|   +-- kb.json
|   +-- train.json
|   +-- val.json
|   +-- test.json
+-- GRU
|   +-- preprocess.py
|   +-- train.py
|   +-- ...
+-- KVMemNN
+-- RGCN
...

About

Pytorch implementation of baseline models of KQA Pro, a large-scale dataset of complex question answering over knowledge base.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages