Skip to content

In this I have used various text classification techniques in CLINC_OOS and BANKING77 datasets

License

Notifications You must be signed in to change notification settings

ianuragbhatt/clinc_oos

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CLINC_OSS and BANKING77

Overview

  • In this repo I have used various text classification techniques on CLINC_OOS and BANKING77 datasets.
  • The link of datasets are given in their sections.
  • The goal is to get in ranking in PapersWithCode clinc_oos text_classification leaderboard.
  • This repo is divided into two directories named clinc_oos and Banking77
  • You can check their respective sections with their overview and accuracy table.

Overview

  • These notebooks have different techniques which I used to do intent classification in clinc_oss dataset.
  • The data folder have dataset which is used to create these notebooks.

Context

  • We have 150 intents in the dataset which we have to classify. And those sentences which don't fall on any intents we have to classify it as Out of Scope(oos). So, we have 150 + 1 = 151 intents. I have created multiple notebooks which have classification of intents without Out of Scope(oos) and with oos. You can read the descriptions of notebooks on below table. And the results of notebooks in Results section

Notebooks Description

Notebook Description
1_clinc_oos_RNN_CNN_Glove_no_oos RNN+CNN using Glove Embeddings with no oos intent.
2_clinc_oos_RNN_CNN_Glove_with_oos RNN+CNN using Glove Embeddings with oos intent.
3_clinc_oss_BERT_with_oos RNN+CNN with BERT Embeddings with oos intent.
4_clinc_oss_distilBERT_with_oos Huggingface DistilBERT model with oos intent.

Results

Below table have Accuracy Score(ACC), Adjusted Random Score(ARS), Normalized Mutual Info(NMI) which I got on CLINC_OOS Dataset.

Note: these results are on validation dataset not on test

Notebooks ACC ARS NMI
1_clinc_oos_RNN_CNN_Glove_no_oos 91.53 83.58 93.85
2_clinc_oos_RNN_CNN_Glove_with_oos 87.97 73.19 91.39
3_clinc_oss_BERT_with_oos 0 0 0
4_clinc_oss_distilBERT_with_oos 94 84.6 95.51

Overview

  • These notebooks have different techniques which I used to do text classification in banking77 dataset.
  • dataset has loaded from hugging face load_dataset lib.

Context

  • In this banking77 dataset we have 77 intents or labels which we have to predict

Notebooks Description

Notebook Description
1_banking77_RNN_CNN_Glove.ipynb RNN+CNN using Glove Embeddings.
2_banking77_RNN_CNN_fastText.ipynb RNN+CNN using Glove and GRU.

Results

Notebooks ACC
1_banking77_RNN_CNN_Glove.ipynb 87.95
2_banking77_RNN_CNN_fastText.ipynb 88.28

Contribution

Contributions are welcome! For bug reports or requests please submit an issue.

Contact-Info

Feel free to contact me to discuss any issues, questions, or comments.

About

In this I have used various text classification techniques in CLINC_OOS and BANKING77 datasets

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published