Missing data treatment for locally weighted partial least square‐based modelling: A comparative study | Zendy

Yeo Wan Sieng | Zendy; Saptoro Agus | Zendy; Kumar Perumal | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Missing data treatment for locally weighted partial least square‐based modelling: A comparative study

Author(s) -

Yeo Wan Sieng,

Saptoro Agus,

Kumar Perumal

Publication year - 2020

Publication title -

asia‐pacific journal of chemical engineering

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.348

H-Index - 35

eISSN - 1932-2143

pISSN - 1932-2135

DOI - 10.1002/apj.2422

Subject(s) - missing data , imputation (statistics) , partial least squares regression , singular value decomposition , data mining , computer science , regression , soft sensor , pattern recognition (psychology) , statistics , artificial intelligence , mathematics , process (computing) , machine learning , operating system

Adaptive soft sensors including widely used locally weighted partial least square (LW‐PLS) have been established for online prediction, fault detection, and process monitoring. Nevertheless, majority of these existing adaptive soft sensors have zero tolerance to missing data, and the presence of missing data is inevitable due to sensor failures, routine maintenance, changes in sensor equipment over time, merging data from different system, and so forth. In the literature, limited studies could be found on the effects of missing data and the existing missing data imputation methods on the predictive performances of adaptive soft sensors. This work reports combined use of different missing data imputation techniques on LW‐PLS and the nonadaptive soft sensor, the partial least square (PLS). Well known trimmed score regression (TSR) and singular value decomposition (SVD) were employed in this study, and thus, the newly integrated TSR‐LW‐PLS and SVD‐LW‐PLS algorithms were proposed. Meanwhile, both existing TSR‐PLS and SVD‐PLS were used, and their results compared with the novel TSR‐LW‐PLS and SVD‐LW‐PLS algorithms. These algorithms were tested and evaluated using two examples having different percentages of missing data ranging from 5% to 40%. Results showed that TSR‐LW‐PLS model was superior compared with SVD‐LW‐PLS, TSR‐PLS, and SVD‐PLS as indicated by more than 100% improvements in prediction accuracy. It was also evident that TSR‐LW‐PLS is able to cope well with up to 20% of missing data.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research