z-logo
Premium
On the effectiveness of cross‐fitting in multi‐block PLS (CF‐MBPLS)
Author(s) -
Kohonen Jarno,
Alatalo Hannu,
Reinikainen SatuPia
Publication year - 2012
Publication title -
journal of chemometrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.47
H-Index - 92
eISSN - 1099-128X
pISSN - 0886-9383
DOI - 10.1002/cem.1413
Subject(s) - overfitting , partial least squares regression , block (permutation group theory) , outlier , computer science , artificial intelligence , latent variable , pattern recognition (psychology) , projection (relational algebra) , machine learning , data mining , mathematics , algorithm , artificial neural network , geometry
Multi‐block PLS is an extension of partial least squares or projection to latent structures (PLS), where the descriptor matrix is divided into meaningful blocks based on either process units or type of data. A typical application is using process variables as one block and spectral data on another block. It has been utilized in obtaining more information of processes and the effect of different types of variables. In comparison with priority or hierarchical PLS, in multi‐block PLS, there is no need to prioritize blocks in advance because they are iteratively calculated at the same time. With multi‐block PLS, however, it is easy to overfit data resulting in a poor predictive ability. A recent development called cross‐fitting has been reported to alleviate the problem of overfitting in PLS. This approach was adjusted to multi‐block PLS and is tested on two different data sets, where overfitting and sensitivity to outliers are issues. Copyright © 2012 John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here