Skip to content

Character Embedding + ESIM + Focal Loss for Chinese Answer Sentence Selection

Notifications You must be signed in to change notification settings

luciusssss/answer-sentence-selection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Character Embedding + ESIM + Focal Loss for Chinese Answer Sentence Selection

This is a course project for Web Data Mining. The task is to decide whether a sentence contains the answer to the questions. We use ESIM (Enhanced LSTM for Natural Language Inference) as our main model. Pretrained Chinese character embedding is adopted to faciliate character-level matching between questions and answers. We employ focal loss to address the unbalanced label. A PowerPoint slide is attached in which we further explain our method.

Requirement

  • Python (>= 3.6)
  • PyTorch (>= 1.0)
  • torchtext

Dataset

The dataset for this project is NLPCC DBQA 2016.

Result

MAP MRR
All-0 25.30 25.81
BERT 93.73 93.83
Ours 90.33 90.48

Reference

About

Character Embedding + ESIM + Focal Loss for Chinese Answer Sentence Selection

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages