Assessment of maximum likelihood PCA missing data imputation | Zendy

FolchFortuny Abel | Zendy; Arteaga Francisco | Zendy; Ferrer Alberto | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Assessment of maximum likelihood PCA missing data imputation

Author(s) -

FolchFortuny Abel,

Arteaga Francisco,

Ferrer Alberto

Publication year - 2016

Publication title -

journal of chemometrics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.47

H-Index - 92

eISSN - 1099-128X

pISSN - 0886-9383

DOI - 10.1002/cem.2804

Subject(s) - imputation (statistics) , missing data , principal component analysis , principal component regression , partial least squares regression , statistics , regression , regression analysis , mathematics , computer science

Maximum likelihood principal component analysis (MLPCA) was originally proposed to incorporate measurement error variance information in principal component analysis (PCA) models. MLPCA can be used to fit PCA models in the presence of missing data, simply by assigning very large variances to the non‐measured values. An assessment of maximum likelihood missing data imputation is performed in this paper, analysing the algorithm of MLPCA and adapting several methods for PCA model building with missing data to its maximum likelihood version. In this way, known data regression (KDR), KDR with principal component regression (PCR), KDR with partial least squares regression (PLS) and trimmed scores regression (TSR) methods are implemented within the MLPCA method to work as different imputation steps. Six data sets are analysed using several percentages of missing data, comparing the performance of the original algorithm, and its adapted regression‐based methods, with other state‐of‐the‐art methods. Copyright © 2016 John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research