A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
-
Updated
Sep 25, 2024 - Python
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
TunBERT is the first release of a pre-trained BERT model for the Tunisian dialect using a Tunisian Common-Crawl-based dataset. TunBERT was applied to three NLP downstream tasks: Sentiment Analysis (SA), Tunisian Dialect Identification (TDI) and Reading Comprehension Question-Answering (RCQA)
Language and Speech Technology for Central Kurdish Varieties (LREC-COLING 2024)
VarDial19 shared task: Discriminating between Mainland and Taiwan Variation of Mandarin Chinese (DMT)
A tool that predicts the dialect of English of an SMS message using recurrent neural networks supplemented with data from Google Trends.
using AraBert to classify different Arabic dialects. ranked fourth in WANLP2020 workshop.
log MFSC based classification of British English dialects from the IViE(Intonational Variation in English) corpus dataset
[Interspeech19] Computational Paralinguistics ChallengE (ComParE)
Add a description, image, and links to the dialect-identification topic page so that developers can more easily learn about it.
To associate your repository with the dialect-identification topic, visit your repo's landing page and select "manage topics."