Premium
Shrunken centroids regularized discriminant analysis as a promising strategy for metabolomics data exploration
Author(s) -
Chen Chen,
Zhang ZhiMin,
Ouyang MeiLan,
Liu Xinbo,
Yi Lunzhao,
Liang YiZeng,
Zhang ChaoPing
Publication year - 2015
Publication title -
journal of chemometrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.47
H-Index - 92
eISSN - 1099-128X
pISSN - 0886-9383
DOI - 10.1002/cem.2685
Subject(s) - linear discriminant analysis , centroid , feature selection , optimal discriminant analysis , computer science , artificial intelligence , pattern recognition (psychology) , metabolomics , discriminant , estimator , data mining , machine learning , mathematics , statistics , bioinformatics , biology
Metabolomics datasets generated by modern analytical instruments tend to be increasingly complex. In this study, a recent method named shrunken centroids regularized discriminant analysis (SCRDA) has been introduced and applied in the exploration of metabolomics dataset. It is a supervised method for variable selection, discriminant analysis and biomarker screening. By regularizing the estimate of the within‐class covariance matrix, SCRDA can deal with the singularity issue of linear discriminant analysis. Then a shrinkage estimator is applied to perform variable selection. The method presented is illustrated through the simulated datasets and three complex metabolomics datasets. Commonly used orthogonal partial least squares discriminant analysis and two other similar statistical methods, penalized linear discriminant analysis and nearest shrunken centroids, are used for comparisons. The results illustrate that SCRDA has some desirable abilities in variable selection, classification and prediction. Moreover, the biomarkers identified by SCRDA are further demonstrated to be in accordance with the biochemical research. It has been proved that SCRDA can be applied as a promising strategy in metabolomics. Copyright © 2014 John Wiley & Sons, Ltd.