Question Answering using Dynamic Coattention Networks

https://arxiv.org/pdf/1611.01604.pdf

Results

Our best model achieved an F1 of 73.5% and EM of 63.4% after 9 epochs, compared to the paper's F1 of 75.6% and EM of 65.4%.

Running the program

Config

The file located at network/config.py supplies the config for most aspects of the model. A sample is attached below.

class Config:
    def __init__(self):
        self.BATCH_SIZE = 64
        self.EMBEDDING_DIMENSION = 300
        self.MAX_EPOCHS = 100
        self.HIDDEN_UNIT_SIZE = 200
        self.POOL_SIZE = 4

        self.LEARNING_RATE = 0.001
        self.CLIP_NORM = 3.0
        self.DROPOUT_KEEP_PROB = 0.7
        self.TRAIN_PERCENTAGE = 0.90
        
        self.QUESTION_FILE = 'data/train.json'
        self.QUESTION_FILE_V2 = 'data/train-v2.0.json'
        self.EMBEDDING_FILE = 'data/glove.840B.300d.txt'

        self.MAX_CONTEXT_LENGTH = 632
        self.MAX_QUESTION_LENGTH = 40
        
CONFIG = Config()

Training

Run python train/train.py to train the model. This loads cached word embeddings from the /data/ folder, or generates and saves them if they haven't been generated before. You can supply additional parameters:

--regenerateEmbeddings to force generating and resaving the cached word embeddings to disk
--noGPU to disable running with the GPU.

The training will automatically test on a validation subset, and log loss, f1, and exact match statistics to CSV in the /results/ folder. The model is also saved every epoch in the /model/ folder.

Evaluating

To evaluate F1/EM performance on the dev set, the file test/test_saved_model.py is what you need.

Graphs

Dataset Statistics

Example Context Heatmap (zoom in)

Interactive mode

Use the file knowledgebase.py to ask any question. A context will be automatically retrieved from Wikipedia and an answer returned from it.

Adaptation to SQuAD 2.0

We attempted to adapt the method in the paper to work with SQuAD 2.0 which also contains unanswerable questions.

One approach we attempted was to keep the first row of U (corresponding to the sentinel) and have it predict if a question is unanswerable. To train this modified network run train/trainv2.py. Results were EM 35.5% and F1 47.5% on the dev set.

Another approach is to train a classifier separately. For this we trained a classifier consisting of an encoder followed by a CNN classifier. To train this classifier run train/train_classifier.py. This classifier did not perform particularly well since it achieved 58% classification accuracy on the dev set (which contains an equal split of answerable and unanswerable questions).

Name		Name	Last commit message	Last commit date
Latest commit History 486 Commits
data		data
generated		generated
model		model
network		network
preprocessing		preprocessing
results		results
test		test
train		train
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
evaluation_metrics.py		evaluation_metrics.py
knowledgebase.py		knowledgebase.py
plot_predicted_probs.py		plot_predicted_probs.py
resume_training.py		resume_training.py
score.py		score.py
svmdataset.py		svmdataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Question Answering using Dynamic Coattention Networks

Results

Running the program

Config

Training

Evaluating

Graphs

Dataset Statistics

Example Context Heatmap (zoom in)

Interactive mode

Adaptation to SQuAD 2.0

About

Releases

Packages

Contributors 4

Languages

jamjar919/dynamic-coattention-networks

Folders and files

Latest commit

History

Repository files navigation

Question Answering using Dynamic Coattention Networks

Results

Running the program

Config

Training

Evaluating

Graphs

Dataset Statistics

Example Context Heatmap (zoom in)

Interactive mode

Adaptation to SQuAD 2.0

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages