z-logo
open-access-imgOpen Access
Comparison Analysis: Large Data Classification Using PLS-DA and Decision Trees
Author(s) -
Nurazlina Abdul Rashid,
Norashikin Nasaruddin,
K. D. A. Kassim,
Amirah Hazwani Abdul Rahim
Publication year - 2020
Publication title -
mathematics and statistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.149
H-Index - 3
eISSN - 2332-2144
pISSN - 2332-2071
DOI - 10.13189/ms.2020.080205
Subject(s) - mathematics , decision tree , statistics , pattern recognition (psychology) , artificial intelligence , computer science
Classification studies are widely applied in many areas of research. In our study, we are using classification analysis to explore approaches for tackling the classification problem for a large number of measures using partial least square discriminant analysis (PLS-DA) and decision trees (DT). The performance for both methods was compared using a sample data of breast tissues from the University of Wisconsin Hospital. A partial least square discriminant analysis (PLS-DA) and decision trees (DT) predict the diagnosis of breast tissues (M = malignant, B = benign). A total of 699 patients diagnose (458 benign and 241 malignant) are used in this study. The performance of PLS-DA and DT has been evaluated based on the misclassification error and accuracy rate. The results show PLS-DA can be considered as a good and reliable technique to be used when dealing with a large dataset for the classification task and have good prediction accuracy.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom