Subsystem Identification Through Dimensionality Reduction of Large-Scale Gene Expression Data | Zendy

Philip M. Kim | Zendy; Bruce Tidor | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Subsystem Identification Through Dimensionality Reduction of Large-Scale Gene Expression Data

Author(s) -

Philip M. Kim,

Bruce Tidor

Publication year - 2003

Publication title -

genome research

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 9.556

H-Index - 297

eISSN - 1549-5469

pISSN - 1088-9051

DOI - 10.1101/gr.903503

Subject(s) - non negative matrix factorization , dimensionality reduction , computational biology , biology , similarity (geometry) , identification (biology) , computer science , curse of dimensionality , expression (computer science) , data mining , data set , set (abstract data type) , exploit , matrix decomposition , basis (linear algebra) , scale (ratio) , pattern recognition (psychology) , artificial intelligence , mathematics , image (mathematics) , eigenvalues and eigenvectors , physics , botany , computer security , geometry , quantum mechanics , programming language

The availability of parallel, high-throughput biological experiments that simultaneously monitor thousands of cellular observables provides an opportunity for investigating cellular behavior in a highly quantitative manner at multiple levels of resolution. One challenge to more fully exploit new experimental advances is the need to develop algorithms to provide an analysis at each of the relevant levels of detail. Here, the data analysis method non-negative matrix factorization (NMF) has been applied to the analysis of gene array experiments. Whereas current algorithms identify relationships on the basis of large-scale similarity between expression patterns, NMF is a recently developed machine learning technique capable of recognizing similarity between subportions of the data corresponding to localized features in expression space. A large data set consisting of 300 genome-wide expression measurements of yeast was used as sample data to illustrate the performance of the new approach. Local features detected are shown to map well to functional cellular subsystems. Functional relationships predicted by the new analysis are compared with those predicted using standard approaches; validation using bioinformatic databases suggests predictions using the new approach may be up to twice as accurate as some conventional approaches.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research