Starred repositories
The aim of this project is to provide a simple ETL that will extract Instagram comments data from Phantombuster, transform it and move to Amazon S3 Data Lake
Arabic Dialectal Offensive Language dataset from social media comments on news post from facebook, twitter and youtube platforms
Fine-tune BERT models to classify Arabic text by different dialects.
An Arabic Tweet Dialect Classifier
Dictionary app that allows you to look up Arabic words in transliteration
Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/
A curated collection of resources and repositories for Natural Language Processing (NLP) tasks specific to Darija, the Moroccan Arabic dialect. This repository aims to provide students and research…
[EACL 2021] Self-training Pretrained LMs for Zero- and Few-shot Arabic Sequence Labeling
CLIP (Contrastive Language–Image Pre-training) for Italian
🌐 A Backend service template project developed with Python Flask
OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve access to justice in India. Legal NER is one the AI component…
This repo contains a series of tutorials and code examples highlighting different features of the OCI Data Science and AI services, along with a release vehicle for experimental programs.
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer
مستودع الأوراق المسحية في معالجة اللغة العربية (أسبر) A Repository for survey and review papers in Arabic Natural Language processing (ANLP).
Braggi is a Python based Contextual Chatbot Framework, which hopes to integrate all the necessities for a great chatbot framework, to satisfy both enterprise and general audiences alike. Developmen…
This Repository contains the list of various Machine and Deep Learning related projects. Related code and data files are available inside this folder. One can go through these projects to implement…
Source code for transferable dialogue state generator (TRADE, Wu et al., 2019). https://arxiv.org/abs/1905.08743
This repository contains copies of the major repositories that contain Arabic QA dataset that follows the SQuAD format
Arabic Open Domain Question Answering System using Neural Reading Comprehension
Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question ans…
Jupyter notebook that contains the workflow for cleaning scraped HTML sites for NLP in Python
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.