z-logo
open-access-imgOpen Access
Small sample issues for microarray‐based classification
Author(s) -
Dougherty Edward R.
Publication year - 2001
Publication title -
comparative and functional genomics
Language(s) - English
Resource type - Journals
eISSN - 1532-6268
pISSN - 1531-6912
DOI - 10.1002/cfg.62
Subject(s) - classifier (uml) , computer science , feature selection , sample size determination , data mining , microarray analysis techniques , artificial intelligence , dna microarray , pattern recognition (psychology) , machine learning , statistics , mathematics , biology , gene , gene expression , biochemistry
In order to study the molecular biological differences between normal and diseased tissues, it is desirable to perform classification among diseases and stages of disease using microarray‐based gene‐expression values. Owing to the limited number of microarrays typically used in these studies, serious issues arise with respect to the design, performance and analysis of classifiers based on microarray data. This paper reviews some fundamental issues facing small‐sample classification: classification rules, constrained classifiers, error estimation and feature selection. It discusses both unconstrained and constrained classifier design from sample data, and the contributions to classifier error from constrained optimization and lack of optimality owing to design from sample data. The difficulty with estimating classifier error when confined to small samples is addressed, particularly estimating the error from training data. The impact of small samples on the ability to include more than a few variables as classifier features is explained. Copyright © 2001 John Wiley & Sons, Ltd.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here