z-logo
open-access-imgOpen Access
Finding author similarity by clustering probabilistic LSA factors in INDIAN english authors poetry
Author(s) -
Kusum Kumar,
Venkata Naresh Mandhala,
Sudheshna Vempati,
Subba Rao Peram
Publication year - 2018
Publication title -
international journal of engineering and technology
Language(s) - English
Resource type - Journals
ISSN - 2227-524X
DOI - 10.14419/ijet.v7i2.7.12235
Subject(s) - probabilistic latent semantic analysis , cluster analysis , similarity (geometry) , artificial intelligence , computer science , semantic similarity , probabilistic logic , latent semantic analysis , word (group theory) , natural language processing , mathematics , geometry , image (mathematics)
High dimensionality and sparseness is the big challenge to the data scientists to discover the similarity among the documents. In unsuper-vised learning data is unlabeled and there is no clear distance measures to discover the clusters among the data. In this paper we considered Indian English Authors poems to cluster them using Probabilistic Latent Semantic Analysis, using which we analyzed the authors similarity. We compared the results of clustering with Latent Semantic Analysis method, a word occurrence method. In this case, Results are shown that probabilistic methods are performing good clustering than the word occurrence method.  

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here