
Transfer learning and wavelength selection method in NIR spectroscopy to predict glucose and lactate concentrations in culture media using VIP‐Boruta
Author(s) -
Kaneko Hiromasa,
Kono Shunsuke,
Nojima Akihiro,
Kambayashi Takuya
Publication year - 2021
Publication title -
analytical science advances
Language(s) - English
Resource type - Journals
ISSN - 2628-5452
DOI - 10.1002/ansa.202000177
Subject(s) - partial least squares regression , selection (genetic algorithm) , artificial intelligence , regression analysis , feature selection , computer science , biological system , mathematics , machine learning , biology
Regression models are constructed to predict glucose and lactate concentrations from near‐infrared spectra in culture media. The partial least‐squares (PLS) regression technique is employed, and we investigate the improvement in the predictive ability of PLS models that can be achieved using wavelength selection and transfer learning. We combine Boruta, a nonlinear variable selection method based on random forests, with variable importance in projection (VIP) in PLS to produce the proposed variable selection method, VIP‐Boruta. Furthermore, focusing on the situation where both culture medium samples and pseudo‐culture medium samples can be used, we transfer pseudo media to culture media. Data analysis with an actual dataset of culture media and pseudo media confirms that VIP‐Boruta can effectively select appropriate wavelengths and improves the prediction ability of PLS models, and that transfer learning with pseudo media enhances the predictive ability. The proposed method could reduce the prediction errors by about 61% for glucose and about 16% for lactate, compared to the traditional PLS model.