Data repository for pretrained NLP models and NLP corpora.
-
Updated
Mar 16, 2018 - Python
Data repository for pretrained NLP models and NLP corpora.
A tool to suggest github repositories based on the repositories you have shown interest in.
Data Science algorithms and topics that you must know. (Newly Designed) Recommender Systems, Decision Trees, K-Means, LDA, RFM-Segmentation, XGBoost in Python, R, and Scala.
针对微博平台的微博文本数据进行舆情分析项目,内容有微博爬虫、LDA主题分析和情感分析
Using latent Dirichlet allocation (LDA) in Apache Lucene
对汽车之家论坛里的评论数据处理和分析,利用用户潜在行为数据得出用户行为特征,采用LDA主题模型得出用户评论的主题特征,采用Word2Vec词向量模型得出用户评论的文本内容特征,采用K-Means聚类得出水军文本类别,结合用户行为特征,最终实现了对网络水军的识别。
Code to run LDA algorithm on Twitter/Foursquare scraped data.
This Python project develops a LDA model which trains on various Wikipedia articles based on a keyword and then suggests Wikipedia articles based on a search query.
REST web service to compute and query Latent Dirichlet Allocation models
A consolidated collection of topic model implementations
Drake Analysis: a deeper look into the discography of Canada's Rap King using various NLP techniques.
Extreme Extractive Text Summarization and Topic Modeling (using LSA and LDA techniques) over Reddit Posts from TLDRHQ dataset.
An NLP project that compares different approaches to document representation and classification. The techniques used include Topic-modeling, Tf-Idf, doc2vec, SVM, and CNN
NLP of Ted Talk transcripts and a recommender.
Predict shop categories by Topic modeling with latent Dirichlet allocation and gensim
Use Word2vec model and LDA model for drug recommendation
Add a description, image, and links to the lda-model topic page so that developers can more easily learn about it.
To associate your repository with the lda-model topic, visit your repo's landing page and select "manage topics."