A Novel Fuzzy based Clustering Algorithm for Text Classification
Author(s) -
A. KrishnaMohan,
Vidyadhar Rao,
MHM Krishna Prasad
Publication year - 2012
Publication title -
international journal of computer applications
Language(s) - English
Resource type - Journals
ISSN - 0975-8887
DOI - 10.5120/7211-9998
Subject(s) - computer science , cluster analysis , fuzzy clustering , data mining , artificial intelligence , pattern recognition (psychology) , feature (linguistics) , clustering high dimensional data , fuzzy logic , machine learning , philosophy , linguistics
Due to the flourish of World Wide Web and the rapid development of the Internet technology, the increasing volume of digital textual data become more and more unmanageable, therefore the importance of text classification has gained significant attention. Text classification pose some specific challenges such as high dimensionality with each document (data point) having only a very small subset of them and representing multiple labels at the same time. Feature clustering is a powerful method to reduce the dimensionality of feature vectors for text classification. Many researchers worked on Feature Clustering for efficient text classification. Recently a Fuzzy based feature clustering was proposed in which Gaussian distribution is used for fuzzy membership function for clustering. But the problem of skewness may occur with this distribution. To overcome that we propose an efficient Fuzzy similarity based membership function for efficient clustering and with this proposed algorithm satisfactory results obtained.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom