Principal component analysis for interval‐valued observations | Zendy

DouzalChouakria A. | Zendy; Billard L. | Zendy; Diday E. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Principal component analysis for interval‐valued observations

Author(s) -

DouzalChouakria A.,

Billard L.,

Diday E.

Publication year - 2011

Publication title -

statistical analysis and data mining: the asa data science journal

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.381

H-Index - 33

eISSN - 1932-1872

pISSN - 1932-1864

DOI - 10.1002/sam.10118

Subject(s) - principal component analysis , interval (graph theory) , hypercube , computer science , data mining , pattern recognition (psychology) , feature (linguistics) , algorithm , vertex (graph theory) , interval data , mathematics , artificial intelligence , theoretical computer science , graph , measure (data warehouse) , combinatorics , linguistics , philosophy , parallel computing

One feature of contemporary datasets is that instead of the single point value in the p ‐dimensional space ℜ p seen in classical data, the data may take interval values thus producing hypercubes in ℜ p . This paper studies the vertices principal components methodology for interval‐valued data; and provides enhancements to allow for so‐called ‘trivial’ intervals, and generalized weight functions. It also introduces the concept of vertex contributions to the underlying principal components, a concept not possible for classical data, but one which provides a visualization method that further aids in the interpretation of the methodology. The method is illustrated in a dataset using measurements of facial characteristics obtained from a study of face recognition patterns for surveillance purposes. A comparison with analyses in which classical surrogates replace the intervals, shows how the symbolic analysis gives more informative conclusions. A second example illustrates how the method can be applied even when the number of parameters exceeds the number of observations, as well as how uncertainty data can be accommodated. © 2011 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 4: 229–246 2011

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research