z-logo
open-access-imgOpen Access
Gene selection criterion for discriminant microarray data analysis based on extreme value distributions
Author(s) -
Wentian Li,
Ivo Große
Publication year - 2003
Publication title -
citeseer x (the pennsylvania state university)
Language(s) - English
Resource type - Conference proceedings
ISBN - 1-58113-635-8
DOI - 10.1145/640075.640103
Subject(s) - linear discriminant analysis , computer science , data mining , selection (genetic algorithm) , microarray analysis techniques , extreme value theory , logistic regression , discriminant , gene selection , artificial intelligence , machine learning , pattern recognition (psychology) , statistics , mathematics , biology , gene , biochemistry , gene expression
An important issue commonly encountered in the analysis of microarray data is to decide which and how many genes should be selected for further studies. For discriminant microarray data analyses based on statistical models, such as the logistic regression model, this gene selection can be accomplished by a comparison of the maximum likelihood of the model given the real data, L(D|M), and the expected maximum likelihood of the model given an ensemble of surrogate data, L(D0|M). Typically, the computational burden for obtaining L(D0|M) is immense, often exceeding the limits of available resources by orders of magnitude. Here, we propose an approach that circumvents such heavy computations by mapping the simulation problem to an extreme value problem, which can be easily solved by numerical simulation. We choose three classification problems from two publicly available microarray datasets to illustrate that approach.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom