z-logo
Premium
Augmentation of a nearest neighbour clustering algorithm with a partial supervision strategy for biomedical data classification
Author(s) -
Salem Sameh A.,
Salem Nancy M.,
Nandi Asoke K.
Publication year - 2009
Publication title -
expert systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.365
H-Index - 38
eISSN - 1468-0394
pISSN - 0266-4720
DOI - 10.1111/j.1468-0394.2008.00502.x
Subject(s) - computer science , cluster analysis , data mining , scalability , robustness (evolution) , a priori and a posteriori , nearest neighbour , data set , pattern recognition (psychology) , set (abstract data type) , artificial intelligence , algorithm , database , biochemistry , chemistry , philosophy , epistemology , gene , programming language
In this paper, a partial supervision strategy for a recently developed clustering algorithm, the nearest neighbour clustering algorithm (NNCA), is proposed. The proposed method (NNCA‐PS) offers classification capability with a smaller amount of a priori knowledge, where a small number of data objects from the entire data set are used as labelled objects to guide the clustering process towards a better search space. Experimental results show that NNCA‐PS gives promising results of 89% sensitivity at 95% specificity when used to segment retinal blood vessels, and a maximum classification accuracy of 99.5% with 97.2% average accuracy when applied to a breast cancer data set. Comparisons with other methods indicate the robustness of the proposed method in classification. Additionally, experiments on parallel environments indicate the suitability and scalability of NNCA‐PS in handling larger data sets.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here