p-Spectral Clustering Based on Neighborhood Attribute Granulation
Author(s) -
Shifei Ding,
Hongjie Jia,
Mingjing Du,
Qiankun Hu
Publication year - 2016
Publication title -
ifip advances in information and communication technology
Language(s) - English
Resource type - Book series
SCImago Journal Rank - 0.189
H-Index - 53
eISSN - 1868-422X
pISSN - 1868-4238
DOI - 10.1007/978-3-319-48390-0_6
Subject(s) - cluster analysis , data mining , cure data clustering algorithm , correlation clustering , spectral clustering , mathematics , pattern recognition (psychology) , canopy clustering algorithm , computer science , data stream clustering , clustering high dimensional data , data point , artificial intelligence
Part 2: Data MiningInternational audienceClustering analysis is an important method for data mining and information statistics. Data clustering is to find the intrinsic links between objects and describe the internal structures of data sets. p-Spectral clustering is based on Cheeger cut criterion. It has good performance on many challenging data sets. But the original p-spectral clustering algorithm is not suitable for high-dimensional data. To solve this problem, this paper improves p-spectral clustering using neighborhood attribute granulation and proposes NAG-pSC algorithm. Neighborhood rough sets can directly process the continuous data. We introduce information entropy into the neighborhood rough sets to weaken the negative impact of noise data and redundant attributes on clustering. In this way, the data points within the same cluster are more compact, while the data points between different clusters are more separate. The effectiveness of the proposed NAG-pSC algorithm is tested on several benchmark data sets. Experiments show that the neighborhood attribute granulation will highlight the differences between data points while maintaining their characteristics in the clustering. With the help of neighborhood attribute granulation, NAG-pSC is able to recognize more complex data structures and has strong robustness to the noise or irrelevant features in high-dimensional data
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom