z-logo
open-access-imgOpen Access
Latent Feature Word Representations to Enhance Topic Models for Text Mining Algorithms
Author(s) -
Dr Thayyaba Khatoon Mohammed,
M. Gayatri,
M. Sandeep,
D. Venkatarami Reddy
Publication year - 2019
Publication title -
international journal of engineering and advanced technology
Language(s) - English
Resource type - Journals
ISSN - 2249-8958
DOI - 10.35940/ijeat.b2503.129219
Subject(s) - latent dirichlet allocation , topic model , computer science , document clustering , cluster analysis , word2vec , exploit , artificial intelligence , leverage (statistics) , natural language processing , word (group theory) , generative grammar , document classification , probabilistic logic , feature (linguistics) , information retrieval , machine learning , linguistics , philosophy , computer security , embedding
Dealing with large number of textual documents needs proven models that leverage the efficiency in processing. Text mining needs such models to have meaningful approaches to extract latent features from document collection. Latent Dirichlet allocation (LDA) is one such probabilistic generative process model that helps in representing document collections in a systematic approach. In many text mining applications LDA is useful as it supports many models. One such model is known as Topic Model. However, topic models LDA needs to be improved in order to exploit latent feature vector representations of words trained on large corpora to improve word-topic mapping learnt on smaller corpus. With respect to document clustering and document classification, it is essential to have a novel topic models to improve performance. In this paper, an improved topic model is proposed and implemented using LDA which exploits the benefits of Word2Vec tool to have pre-trained word vectors so as to achieve the desired enhancement. A prototype application is built to demonstrate the proof of the concept with text mining operations like document clustering.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here