-
-
word2vec Public
word2vec++ is a Distributed Representations of Words (word2vec) library and tools implementation, written in C++11 from the scratch
-
tgnews Public
Telegram Data Clustering Contest (Bossy Gnu's submission )
-
-
pg_mystem Public
pg_mystem - расширение PostgreSQL для лемматизации (морфологической нормализации) текстов на русском языке. PostgreSQL extension for Yandex Mystem
-
machine-learning-yearning Public
Forked from ajaymache/machine-learning-yearningMachine Learning Yearning book by
🅰️ 𝓷𝓭𝓻𝓮𝔀 🆖 -
russian_news_corpus Public
Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ