
Exploring popular topic models
Author(s) -
Poonam Tijare,
Pooja Rani
Publication year - 2020
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1706/1/012171
Subject(s) - perplexity , topic model , computer science , task (project management) , data science , social media , big data , unstructured data , coherence (philosophical gambling strategy) , artificial intelligence , data mining , information retrieval , world wide web , language model , engineering , systems engineering , physics , quantum mechanics
Information from micro-blogging site such as Twitter is a huge repository of data. A lot of research is happening on sentiments, discovering patterns and prediction. One challenging task is dividing this humongous unstructured data into clusters. Several topic modeling methods are proposed by researchers. This paper presents a brief summary of topic modeling methods LDA, LSI and NMF and their applications. Experiments are conducted on the Twitter based datasets created using tweets on keywords Cauvery river, Lokpal bill and Rahul Gandhi. Paper covers a brief discussion on evaluating the accuracy of topics formed using perplexity, log-likelihood and topic coherence measures. Best topics formed are then fed to the Logistic regression model. The model created is showing better accuracy with LDA.