Microarray Data Analysis: From Hypotheses to Conclusions Using Gene Expression Data
Author(s) -
Nicola J. Armstrong,
Mark A. van de Wiel
Publication year - 2004
Publication title -
analytical cellular pathology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.576
H-Index - 24
eISSN - 2210-7185
pISSN - 2210-7177
DOI - 10.1155/2004/943940
Subject(s) - microarray analysis techniques , microarray databases , computational biology , expression (computer science) , gene expression , microarray , gene , computer science , gene expression profiling , bioinformatics , data mining , biology , genetics , programming language
We review several commonly used methods for the design and analysis of microarray data. To begin with, some experimental design issues are addressed. Several approaches for pre-processing the data (filtering and normalization) before the statistical analysis stage are then discussed. A common first step in this type of analysis is gene selection based on statistical testing. Two approaches, permutation and model-based methods are explained and we emphasize the need to correct for multiple testing. Moreover, powerful approaches based on gene sets are mentioned. Clustering of either genes or samples is frequently performed when analyzing microarray data. We summarize the basics of both supervised and unsupervised clustering (classification). The latter may be of use for creating diagnostic arrays, for example. Construction of biological networks, such as pathways, is a statistically challenging but complex task that is a relatively new development and hence mentioned only briefly. We finish with some remarks on literature and software. The emphasis in this paper is on the philosophy behind several statistical issues and on a critical interpretation of microarray related analysis methods.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom