Classification of questions into 5 classes. The five classes are as follows :
HUMfor questions about humansENTYfor questions about entitiesDESCfor questions asking you for a descriptionNUMfor questions where the answer is numericalLOCfor questions where the answer is a locationABBRfor questions asking about abbreviations
Concepts
- RNN
- LSTM
- Word Embeddings
- Multilayer and Bi-Directional RNNs
RESULTS
Trained for 5 epochs on Kaggle.
Test Accuracy of ~91%
Model is confused between Entity questions and questions about humans.
- PyTorch
- Torchtext (Used
torchtext.legacy) - Spacy
- You can pretrain the model
- You can use the predict_classes function to try your own questions.
Thanks to the author of this for this amazing repo and its explanation and help.