Characteristics of Rough SetC-Means Clustering | Zendy

Seiki Ubukata | Zendy; Keisuke Umado | Zendy; Akira Notsu | Zendy; Katsuhiro Honda | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Characteristics of Rough SetC-Means Clustering

Author(s) -

Seiki Ubukata,

Keisuke Umado,

Akira Notsu,

Katsuhiro Honda

Publication year - 2018

Publication title -

journal of advanced computational intelligence and intelligent informatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.172

H-Index - 20

eISSN - 1343-0130

pISSN - 1883-8014

DOI - 10.20965/jaciii.2018.p0551

Subject(s) - rough set , cluster analysis , mathematics , set (abstract data type) , fuzzy set , fuzzy logic , extension (predicate logic) , artificial intelligence , computer science , pattern recognition (psychology) , fuzzy clustering , algorithm , programming language

Hard C -means (HCM), which is one of the most popular clustering techniques, has been extended by using soft computing approaches such as fuzzy theory and rough set theory. Fuzzy C -means (FCM) and rough C -means (RCM) are respectively fuzzy and rough set extensions of HCM. RCM can detect the positive and the possible regions of clusters by using the lower and the upper areas which are respectively analogous to the lower and the upper approximations in rough set theory. RCM-type methods have the problem that the original definitions of the lower and the upper approximations are not actually used. In this paper, rough set C -means (RSCM), which is an extension of HCM based on the original rough set definition, is proposed as a rough set-based counterpart of RCM. Specifically, RSCM is proposed as a clustering model on an approximation space considering a space granulated by a binary relation and uses the lower and the upper approximations of temporal clusters. For this study, we investigated the characteristics of the proposed RSCM through basic considerations, visual demonstrations, and comparative experiments. We observed the geometric characteristics of the examined methods by using visualizations and numerical experiments conducted for the problem of classifying patients as having benign or malignant disease based on a medical dataset. We compared the classification performance by viewing the trade-off between the classification accuracy in the positive region and the fraction of objects classified as being in the positive region.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research