Premium
Nonlinear Laplacian spectral analysis: capturing intermittent and low‐frequency spatiotemporal patterns in high‐dimensional data
Author(s) -
Giannakis Dimitrios,
Majda Andrew J.
Publication year - 2013
Publication title -
statistical analysis and data mining: the asa data science journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.381
H-Index - 33
eISSN - 1932-1872
pISSN - 1932-1864
DOI - 10.1002/sam.11171
Subject(s) - nonlinear system , eigenfunction , singular spectrum analysis , mathematics , orthonormal basis , diffusion map , nonlinear dimensionality reduction , computer science , algorithm , eigenvalues and eigenvectors , singular value decomposition , artificial intelligence , dimensionality reduction , physics , quantum mechanics
We present a technique for spatiotemporal data analysis called nonlinear Laplacian spectral analysis (NLSA), which generalizes singular spectrum analysis (SSA) to take into account the nonlinear manifold structure of complex datasets. The key principle underlying NLSA is that the functions used to represent temporal patterns should exhibit a degree of smoothness on the nonlinear data manifold M ; a constraint absent from classical SSA. NLSA enforces such a notion of smoothness by requiring that temporal patterns belong in low‐dimensional Hilbert spaces V l spanned by the leading l Laplace‐‐Beltrami eigenfunctions on M . These eigenfunctions can be evaluated efficiently in high ambient‐space dimensions using sparse graph‐theoretic algorithms. Moreover, they provide orthonormal bases to expand a family of linear maps, whose singular value decomposition leads to sets of spatiotemporal patterns at progressively finer resolution on the data manifold. The Riemannian measure of M and an adaptive graph kernel width enhances the capability of NLSA to detect important nonlinear processes, including intermittency and rare events. The minimum dimension of V l required to capture these features while avoiding overfitting is estimated here using spectral entropy criteria. As an application, we study the upper‐ocean temperature in the North Pacific sector of a 700‐year control run of the CCSM3 climate model. Besides the familiar annual and decadal modes, NLSA recovers a family of intermittent processes associated with the Kuroshio current and the subtropical and subpolar gyres. These processes carry little variance (and are therefore not captured by SSA), yet their dynamical role is expected to be significant. The intermittent modes are eliminated from the spectrum if seasonal detrending is performed. © 2012 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 6: 180–194, 2013