Applying stability selection to consistently estimate sparse principal components in high-dimensional molecular data | Zendy

Martin Sill | Zendy; Maral Saadati | Zendy; Axel Benner | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Applying stability selection to consistently estimate sparse principal components in high-dimensional molecular data

Author(s) -

Martin Sill,

Maral Saadati,

Axel Benner

Publication year - 2015

Publication title -

bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 3.599

H-Index - 390

eISSN - 1367-4811

pISSN - 1367-4803

DOI - 10.1093/bioinformatics/btv197

Subject(s) - principal component analysis , selection (genetic algorithm) , stability (learning theory) , computer science , principal (computer security) , algorithm , statistics , data mining , mathematics , pattern recognition (psychology) , artificial intelligence , machine learning , operating system

Principal component analysis (PCA) is a basic tool often used in bioinformatics for visualization and dimension reduction. However, it is known that PCA may not consistently estimate the true direction of maximal variability in high-dimensional, low sample size settings, which are typical for molecular data. Assuming that the underlying signal is sparse, i.e. that only a fraction of features contribute to a principal component (PC), this estimation consistency can be retained. Most existing sparse PCA methods use L1-penalization, i.e. the lasso, to perform feature selection. But, the lasso is known to lack variable selection consistency in high dimensions and therefore a subsequent interpretation of selected features can give misleading results.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research