Topic-modeling on large data (1.85M tweets written in Spanish, ~1M "Spain geolocated", about 'coronavirus' between 2019 to 2020-04-20). Forked from ShuaiW/twitter-analysis (adapted for Python3 to use a discriminative score), mainly for Twitter LDA (Latent Dirichlet allocation using Gibbs sampling, https://lda.readthedocs.io/)
-
Updated
Oct 10, 2021 - Python