Skip to content

Latest commit

 

History

History
17 lines (11 loc) · 651 Bytes

README.md

File metadata and controls

17 lines (11 loc) · 651 Bytes

Xhosa NLP 🔤

GitHub license PRs Welcome

Quickstart

  1. Install NLTK
  2. Run python most_frequent_words.py
  3. Open results.csv to view results

Source of Corpus

Leipzig Corpora Collection

CITE: D. Goldhahn, T. Eckart & U. Quasthoff: Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages. In: Proceedings of the 8th International Language Ressources and Evaluation (LREC'12), 2012