Topic-modeling on large data (1.85M tweets written in Spanish, ~1M "Spain geolocated", about 'coronavirus' between 2019 to 2020-04-20). Forked from ShuaiW/twitter-analysis (adapted for Python3 to use a discriminative score), mainly for Twitter LDA (Latent Dirichlet allocation using Gibbs sampling, https://lda.readthedocs.io/)
-
Updated
Jul 31, 2025 - Python