JointBERT

(Unofficial) Pytorch implementation of JointBERT: BERT for Joint Intent Classification and Slot Filling

Model Architecture

Predict intent and slot at the same time from one BERT model (=Joint model)
total_loss = intent_loss + slot_loss

Dependencies

python>=3.5
torch==1.1.0
transformers==2.2.1
scikit-learn>=0.20.0
seqeval>=0.0.12

Dataset

	Train	Dev	Test	Intent Labels	Slot Labels
ATIS	4,478	500	893	21	120
Snips	13,084	700	700	7	72

The number of labels are based on the train dataset.
Add UNK for labels (For intent and slot labels which are only shown in dev and test dataset)

Usage

$ python3 main.py --task {task_name} \
                  --model_type {model_type} \
                  --model_dir {model_dir_name} \
                  --do_train --do_eval

# For ATIS
$ python3 main.py --task atis \
                  --model_type bert \
                  --model_dir atis_model \
                  --do_train --do_eval
# For Snips
$ python3 main.py --task snips \
                  --model_type bert \
                  --model_dir snips_model \
                  --do_train --do_eval

Results

Run 5 epochs each (No hyperparameter tuning)

		Intent acc (%)	Slot F1 (%)
ATIS	BERT	97.87	95.46
	DistilBERT	97.54	94.89
	RoBERTa	97.64	94.94
Snips	BERT	98.29	96.05
	DistilBERT	98.42	94.10
	RoBERTa	98.14	94.60

Updates

2019/12/03: Add DistilBert and RoBERTa

References

Huggingface Transformers
Keras Implementation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

JointBERT

Model Architecture

Dependencies

Dataset

Usage

Results

Updates

References

Files

README.md

Latest commit

History

README.md

File metadata and controls

JointBERT

Model Architecture

Dependencies

Dataset

Usage

Results

Updates

References