z-logo
Premium
Missing data treatment for locally weighted partial least square‐based modelling: A comparative study
Author(s) -
Yeo Wan Sieng,
Saptoro Agus,
Kumar Perumal
Publication year - 2020
Publication title -
asia‐pacific journal of chemical engineering
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.348
H-Index - 35
eISSN - 1932-2143
pISSN - 1932-2135
DOI - 10.1002/apj.2422
Subject(s) - missing data , imputation (statistics) , partial least squares regression , singular value decomposition , data mining , computer science , regression , soft sensor , pattern recognition (psychology) , statistics , artificial intelligence , mathematics , process (computing) , machine learning , operating system
Adaptive soft sensors including widely used locally weighted partial least square (LW‐PLS) have been established for online prediction, fault detection, and process monitoring. Nevertheless, majority of these existing adaptive soft sensors have zero tolerance to missing data, and the presence of missing data is inevitable due to sensor failures, routine maintenance, changes in sensor equipment over time, merging data from different system, and so forth. In the literature, limited studies could be found on the effects of missing data and the existing missing data imputation methods on the predictive performances of adaptive soft sensors. This work reports combined use of different missing data imputation techniques on LW‐PLS and the nonadaptive soft sensor, the partial least square (PLS). Well known trimmed score regression (TSR) and singular value decomposition (SVD) were employed in this study, and thus, the newly integrated TSR‐LW‐PLS and SVD‐LW‐PLS algorithms were proposed. Meanwhile, both existing TSR‐PLS and SVD‐PLS were used, and their results compared with the novel TSR‐LW‐PLS and SVD‐LW‐PLS algorithms. These algorithms were tested and evaluated using two examples having different percentages of missing data ranging from 5% to 40%. Results showed that TSR‐LW‐PLS model was superior compared with SVD‐LW‐PLS, TSR‐PLS, and SVD‐PLS as indicated by more than 100% improvements in prediction accuracy. It was also evident that TSR‐LW‐PLS is able to cope well with up to 20% of missing data.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here