z-logo
Premium
Optimized preprocessing and machine learning for quantitative Raman spectroscopy in biology
Author(s) -
Storey Emily E.,
Helmy Amr S.
Publication year - 2019
Publication title -
journal of raman spectroscopy
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.748
H-Index - 110
eISSN - 1097-4555
pISSN - 0377-0486
DOI - 10.1002/jrs.5608
Subject(s) - preprocessor , raman spectroscopy , robustness (evolution) , computer science , medical diagnosis , artificial intelligence , machine learning , process (computing) , biological system , pattern recognition (psychology) , data mining , optics , biology , physics , medicine , biochemistry , pathology , gene , operating system
Abstract Raman spectroscopy's capability to provide meaningful composition predictions is heavily reliant on a preprocessing step to remove insignificant spectral variation. This is crucial in biofluid analysis. Widespread adoption of diagnostics using Raman requires a robust model that can withstand routine spectra discrepancies due to unavoidable variations such as age, diet, and medical background. A wealth of preprocessing methods are available, and it is often up to trial‐and‐error or user experience to select the method that gives the best results. This process can be incredibly time consuming and inconsistent for multiple operators. In this study, we detail a method to analyze the statistical variability within a set of training spectra and determine suitability to form a robust model. This allows us to selectively qualify or exclude a preprocessing method, predetermine robustness, and simultaneously identify the number of components that will form the best predictive model. We demonstrate the ability of this technique to improve predictive models of two artificial biological fluids. Raman spectroscopy is ideal for noninvasive, nondestructive analysis. Routine health monitoring that maximizes comfort is increasingly crucial, particularly in epidemic‐level diabetes diagnoses. High variability in spectra of biological samples can hinder Raman's adoption for these methods. Our technique allows the decision of optimal pretreatment method to be determined for the operator; model performance is no longer a function of user experience. We foresee this statistical technique being an instrumental element to widening the adoption of Raman as a monitoring tool in a field of biofluid analysis.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here