z-logo
open-access-imgOpen Access
Uncovering interpretable relationships in high-dimensional scientific data through function preserving projections
Author(s) -
Shusen Liu,
Rushil Anirudh,
Jayaraman J. Thiagarajan,
Peer-Timo Bremer
Publication year - 2020
Publication title -
machine learning: science and technology
Language(s) - English
Resource type - Journals
ISSN - 2632-2153
DOI - 10.1088/2632-2153/abab60
Subject(s) - dimensionality reduction , intuition , computer science , embedding , projection (relational algebra) , high dimensional , function (biology) , nonlinear dimensionality reduction , data mining , artificial intelligence , algorithm , cognitive science , psychology , evolutionary biology , biology
In many fields of science and engineering, we frequently encounter experiments or simulations datasets that describe the behavior of complex systems and uncovering human interpretable patterns between their inputs and outputs via exploratory data analysis is essential for building intuition and facilitating discovery. Often, we resort to 2D embeddings for examining these high-dimensional relationships (e.g. dimensionality reduction). However, most existing embedding methods treat the dimensions as coordinates for samples in a high-dimensional space, which fail to capture the potential functional relationships, and the few methods that do take function into consideration either only focus on linear patterns or produce non-linear embeddings that are hard to interpret. To address these challenges, we proposed function preserving projections (FPP), which construct 2D linear embeddings optimized to reveal interpretable yet potentially non-linear patterns between the domain and the range of a high-dimensional function. The intuition here is that humans are good at understanding potentially non-linear patterns in 2D but unable to interpret non-linear mapping from high-dimensional space to 2D. Therefore, we should restrict the projection to linear but not the pattern we are seeking. Using FPP on real-world datasets, one can obtain fundamentally new insights about high-dimensional relationships in extremely large datasets that could not be processed with existing dimension reduction methods.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here