z-logo
open-access-imgOpen Access
Robust principal component analysis of electromagnetic arrays with missing data
Author(s) -
Smirnov M. Yu.,
Egbert G. D.
Publication year - 2012
Publication title -
geophysical journal international
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.302
H-Index - 168
eISSN - 1365-246X
pISSN - 0956-540X
DOI - 10.1111/j.1365-246x.2012.05569.x
Subject(s) - principal component analysis , multivariate statistics , robust principal component analysis , algorithm , computer science , missing data , robustness (evolution) , data mining , pattern recognition (psychology) , sort , robust regression , context (archaeology) , artificial intelligence , regression analysis , machine learning , paleontology , biochemistry , chemistry , information retrieval , gene , biology
SUMMARY We describe a new algorithm for robust principal component analysis (PCA) of electromagnetic (EM) array data, extending previously developed multivariate methods to include arrays with large data gaps, and only partial overlap between site occupations. Our approach is based on a criss‐cross regression scheme in which polarization parameters and spatial modes are alternately estimated with robust regression procedures. The basic scheme can be viewed as an expectation robust (ER) algorithm, of the sort that has been widely discussed in the statistical literature in the context of robust PCA, but with details of the scheme tailored to the physical specifics of EM array observations. We have tested our algorithm with synthetic and real data, including data denial experiments where we have created artificial gaps, and compared results obtained with full and incomplete data arrays. These tests reveal that for modest amounts of missing data (up to 20 per cent or so) the algorithm performs well, reproducing essentially the same dominant spatial modes that would be obtained from analysis of the complete array. The algorithm thus makes multivariate analysis practical for the first time for large heterogeneous arrays, as we illustrate by application to two different EM arrays.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here