z-logo
Premium
The evolution of partial least squares models and related chemometric approaches in metabonomics and metabolic phenotyping
Author(s) -
Fonville Judith M.,
Richards Selena E.,
Barton Richard H.,
Boulange Claire L.,
Ebbels Timothy M. D.,
Nicholson Jeremy K.,
Holmes Elaine,
Dumas MarcEmmanuel
Publication year - 2010
Publication title -
journal of chemometrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.47
H-Index - 92
eISSN - 1099-128X
pISSN - 0886-9383
DOI - 10.1002/cem.1359
Subject(s) - partial least squares regression , computer science , context (archaeology) , flexibility (engineering) , chemometrics , data mining , visualization , principal component analysis , multivariate statistics , machine learning , artificial intelligence , mathematics , biology , statistics , paleontology
Metabonomics is a key element in systems biology, and with current analytical methods, generates vast amounts of quantitative or qualitative metabolic data. Understanding of the global function of the living organism can be achieved by integration of ‘omics’ approaches including metabonomics, genomics, transcriptomics and proteomics, increasing the complexity of the full data sets. Multivariate statistical approaches are well suited to extract the characterizing metabolic information associated with each level of dynamic process. In this review, we discuss techniques that have evolved from principal component analysis and partial least squares (PLS) methods with a focus on improved interpretation and modeling with respect to biomarker recovery and data visualization in the context of metabonomic applications. Visualization is of paramount importance to investigate complex metabolic signatures, the power and potential of which is illustrated with key papers. Recent improvements based on the removal of orthogonal variation are discussed in terms of interpretation enhancement, and are supported by relevant applications. Flexibility of PLS methods in general and of O‐PLS in particular allows implementation of derivative methods such as O2‐PLS, O‐PLS‐variance components, nonlinear methods, and batch modeling to improve analysis of complex data sets, which facilitates extraction of information related to subtle biological processes. These approaches can be used to address issues present in complex multi‐factorial data sets. Thus, we highlight the key advantages and limitations of the different latent variable applications for top‐down systems biology and assess the differences between the methods available. Copyright © 2010 John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here