Estimation of Subspace Arrangements with Applications in Modeling and Segmenting Mixed Data
Author(s) -
Yi Ma,
Allen Y. Yang,
Harm Derksen,
Robert M. Fossum
Publication year - 2008
Publication title -
siam review
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 4.683
H-Index - 120
eISSN - 1095-7200
pISSN - 0036-1445
DOI - 10.1137/060655523
Subject(s) - subspace topology , linear subspace , computer science , outlier , data mining , identification (biology) , inference , data point , cluster analysis , theoretical computer science , algorithm , artificial intelligence , mathematics , botany , geometry , biology
Recently many scientific and engineering applications have involved the challenging task of analyzing large amounts of unsorted high-dimensional data that have very complicated structures. From both geometric and statistical points of view, such unsorted data are considered mixed as different parts of the data have significantly different structures which cannot be described by a single model. In this paper we propose to use subspace arrangements—a union of multiple subspaces—for modeling mixed data: each subspace in the arrangement is used to model just a homogeneous subset of the data. Thus, multiple subspaces together can capture the heterogeneous structures within the data set. In this paper, we give a comprehensive introduction to a new approach for the estimation of subspace arrangements. This is known as generalized principal component analysis (GPCA). In particular, we provide a comprehensive summary of important algebraic properties and statistical facts that are crucial for making the inference of subspace arrangements both efficient and robust, even when the given data are corrupted by noise or contaminated with outliers. This new method in many ways improves and generalizes extant methods for modeling or clustering mixed data. There have been successful applications of this new method to many real-world problems in computer vision, image processing, and system identification. In this paper, we will examine several of those representative applications. This paper is intended to be expository in nature. However, in order that this may serve as a more complete reference for both theoreticians and practitioners, we take the liberty of filling in several gaps between the theory and the practice in the existing literature.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom