Chemometrics and Experimental Design for the Quantification of Nitrate Salts in Nitric Acid: Near-Infrared Spectroscopy Absorption Analysis
Author(s) -
Luke R. Sadergaski,
Gretchen Toney,
Lætitia H. Delmau,
Kristian Myhre
Publication year - 2021
Publication title -
applied spectroscopy
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.415
H-Index - 110
eISSN - 1943-3530
pISSN - 0003-7028
DOI - 10.1177/0003702820987281
Subject(s) - chemometrics , partial least squares regression , calibration , set (abstract data type) , data set , design of experiments , computer science , principal component analysis , multivariate statistics , nitric acid , linear regression , biological system , chemistry , artificial intelligence , statistics , machine learning , mathematics , inorganic chemistry , programming language , biology
Implementing remote, real-time spectroscopic monitoring of radiochemical processing streams in hot cell environments requires efficiency and simplicity. The success of optical spectroscopy for the quantification of species in chemical systems highly depends on representative training sets and suitable validation sets. Selecting a training set (i.e., calibration standards) to build multivariate regression models is both time- and resource-consuming using standard one-factor-at-a-time approaches. This study describes the use of experimental design to generate spectral training sets and a validation set for the quantification of sodium nitrate (0-1 M) and nitric acid (0.1-10 M) using the near-infrared water band centered at 1440 nm. Partial least squares regression models were built from training sets generated by both D- and I-optimal experimental designs and a one-factor-at-a-time approach. The prediction performance of each model was evaluated by comparing the bias and standard error of prediction for statistical significance. D- and I-optimal designs reduced the number of samples required to build regression models compared with one-factor-at-a-time while also improving performance. Models must be confirmed against a validation sample set when minimizing the number of samples in the training set. The D-optimal design performed the best when considering both performance and efficiency by improving predictive capability and reducing number of samples in the training set by 64% compared with the one-factor-at-a-time approach. The experimental design approach objectively selects calibration and validation spectral data sets based on statistical criterion to optimize performance and minimize resources.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom