Automatic fast double  KNN  classification algorithm based on  ACC  and hierarchical clustering for big data | Zendy

Li Haiyun | Zendy; Li Haifeng | Zendy; Wei Kaibin | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Automatic fast double KNN classification algorithm based on ACC and hierarchical clustering for big data

Author(s) -

Li Haiyun,

Li Haifeng,

Wei Kaibin

Publication year - 2018

Publication title -

international journal of communication systems

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.344

H-Index - 49

eISSN - 1099-1131

pISSN - 1074-5351

DOI - 10.1002/dac.3488

Subject(s) - computer science , cluster analysis , k nearest neighbors algorithm , classifier (uml) , data mining , artificial intelligence , pattern recognition (psychology) , hierarchical clustering , big data , machine learning

Summary In data mining, k ‐nearest neighbors (KNN) classifier is an efficient lazy learning yet simple widely renowned method, which has been widely used in many actual applications, successfully. Because of time and memory restrictions of KNN, when KNN is tested in large‐scale datasets, the classification accuracy is very low. Therefore, we propose an automatic fast double KNN classification algorithm on the basis of automatically determining the cluster centers and hierarchical clustering. We introduce automatically determining the cluster centers into the KNN in training process. Namely, big data samples are divided into several parts depending on our clustering methods. Afterwards, the clusters nearest to testing samples are excavated as the new training samples in the testing process. Each of the new samples is then conducted with hierarchical clustering. In this way, computation and time complexity are greatly reduced. Finally, experiments results conducted on big data show that new KNN classification method can significantly raise the accuracy and efficiency of automatic classification than other state‐of‐the‐art KNN classification algorithms.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research