Highlights
- Pro
Pinned Loading
-
IndoNLP/indonlu
IndoNLP/indonlu PublicThe first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
-
IndoNLP/nusa-crowd
IndoNLP/nusa-crowd PublicA collaborative project to collect datasets in Indonesian languages.
-
IndoNLP/nusax
IndoNLP/nusax PublicHigh-quality parallel resource on sentiment analysis for 10 low-resource Indonesian languages, English, and Indonesian (Outstanding Paper at EACL 2023)
-
IndoNLP/indonlg
IndoNLP/indonlg PublicThe first-ever vast natural language generation benchmark for Indonesian, Sundanese, and Javanese. We provide multiple downstream tasks, pre-trained IndoGPT and IndoBART models, and a starter code!…
-
IndoNLP/nusa-writes
IndoNLP/nusa-writes PublicNusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented and extremely low-resource Indonesian local languages.
-
SEACrowd/seacrowd-datahub
SEACrowd/seacrowd-datahub PublicA collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
If the problem persists, check the GitHub status page or contact support.