Open Access
Sparse Methods in Spectroscopy: An Introduction, Overview, and Perspective
Author(s) -
Erik Andries,
Shawn Martin
Publication year - 2013
Publication title -
applied spectroscopy
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.415
H-Index - 110
eISSN - 1943-3530
pISSN - 0003-7028
DOI - 10.1366/13-07021
Subject(s) - calibration , computer science , model selection , perspective (graphical) , partial least squares regression , multivariate statistics , chemometrics , linear model , selection (genetic algorithm) , statistical model , algorithm , term (time) , machine learning , feature selection , artificial intelligence , mathematics , statistics , physics , quantum mechanics
Multivariate calibration methods such as partial least-squares build calibration models that are not parsimonious: all variables (either wavelengths or samples) are used to define a calibration model. In high-dimensional or large sample size settings, interpretable analysis aims to reduce model complexity by finding a small subset of variables that significantly influences the model. The term “sparsity”, as used here, refers to calibration models having many zero-valued regression coefficients. Only the variables associated with non-zero coefficients influence the model. In this paper, we briefly review the regression problems associated with sparse models and discuss their spectroscopic applications. We also discuss how one can re-appropriate sparse modeling algorithms that perform wavelength selection for purposes of sample selection. In particular, we highlight specific sparse modeling algorithms that are easy to use and understand for the spectroscopist, as opposed to the overly complex “black-box” algorithms that dominate much of the statistical learning literature. We apply these sparse modeling approaches to three spectroscopic data sets.