Premium
Sparse models by iteratively reweighted feature scaling: a framework for wavelength and sample selection
Author(s) -
Andries Erik
Publication year - 2013
Publication title -
journal of chemometrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.47
H-Index - 92
eISSN - 1099-128X
pISSN - 0886-9383
DOI - 10.1002/cem.2492
Subject(s) - chemometrics , calibration , multivariate statistics , computer science , feature selection , sparse approximation , principal component analysis , partial least squares regression , selection (genetic algorithm) , model selection , multidimensional scaling , feature (linguistics) , sparse pca , pattern recognition (psychology) , artificial intelligence , covariate , algorithm , mathematics , machine learning , statistics , linguistics , philosophy
In the past decade, there has been an increase in the use of sparse multivariate calibration methods in chemometrics. Sparsity describes a parsimonious state of model complexity and can be defined in terms of a subset of samples or covariates (e.g., wavelengths) that are used to define the calibration model. With respect to their classical counterparts such as principal component regression or partial least squares, sparse models are more easily interpretable and have been shown to exhibit non‐inferior prediction performance. However, sparse methods are still not as fast as the classical methods in spite of recent numerical advances. In addition, for many chemometricians, sparse methods are still “black‐box” algorithms whose internal workings are not well understood. In this paper, we describe a simple framework whereby classical multivariate calibration methods can be iteratively used to generate sparse models. Moreover, this approach allows for either wavelength or sample sparsity. We demonstrate the effectiveness of this approach on two spectroscopic data sets. Copyright © 2013 John Wiley & Sons, Ltd.