Computational performance and cross‐validation error precision of five PLS algorithms using designed and real data sets | Zendy

Martins João Paulo A. | Zendy; Teófilo Reinaldo F. | Zendy; Ferreira Márcia M. C. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Computational performance and cross‐validation error precision of five PLS algorithms using designed and real data sets

Author(s) -

Martins João Paulo A.,

Teófilo Reinaldo F.,

Ferreira Márcia M. C.

Publication year - 2010

Publication title -

journal of chemometrics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.47

H-Index - 92

eISSN - 1099-128X

pISSN - 0886-9383

DOI - 10.1002/cem.1309

Subject(s) - cross validation , latent variable , algorithm , partial least squares regression , kernel (algebra) , computer science , variable (mathematics) , statistics , data mining , mathematics , artificial intelligence , machine learning , mathematical analysis , combinatorics

An evaluation of computational performance and precision regarding the cross‐validation error of five partial least squares (PLS) algorithms (NIPALS, modified NIPALS, Kernel, SIMPLS and bidiagonal PLS), available and widely used in the literature, is presented. When dealing with large data sets, computational time is an important issue, mainly in cross‐validation and variable selection. In the present paper, the PLS algorithms are compared in terms of the run time and the relative error in the precision obtained when performing leave‐one‐out cross‐validation using simulated and real data sets. The simulated data sets were investigated through factorial and Latin square experimental designs. The evaluations were based on the number of rows, the number of columns and the number of latent variables. With respect to their performance, the results for both simulated and real data sets have shown that the differences in run time are statistically different. PLS bidiagonal is the fastest algorithm, followed by Kernel and SIMPLS. Regarding cross‐validation error, all algorithms showed similar results. However, in some situations as, for example, when many latent variables were in question, discrepancies were observed, especially with respect to SIMPLS. Copyright © 2010 John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore