Identification of Common Prognostic Gene Expression Signatures with Biological Meanings from Microarray Gene Expression Datasets
Author(s) -
Jun Yao,
Qi Zhao,
Ying Yuan,
Li Zhang,
Xiaoming Liu,
W. K. Alfred Yung,
John N. Weinstein
Publication year - 2012
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0045894
Subject(s) - pten , biology , gene expression profiling , microarray analysis techniques , dna microarray , computational biology , gene signature , gene , breast cancer , proportional hazards model , microarray , survival analysis , overfitting , gene expression , bioinformatics , cancer , genetics , medicine , computer science , pi3k/akt/mtor pathway , machine learning , apoptosis , artificial neural network
Numerous prognostic gene expression signatures for breast cancer were generated previously with few overlap and limited insight into the biology of the disease. Here we introduce a novel algorithm named SCoR ( S urvival analysis using Co x proportional hazard regression and R andom resampling) to apply random resampling and clustering methods in identifying gene features correlated with time to event data. This is shown to reduce overfitting noises involved in microarray data analysis and discover functional gene sets linked to patient survival. SCoR independently identified a common poor prognostic signature composed of cell proliferation genes from six out of eight breast cancer datasets. Furthermore, a sequential SCoR analysis on highly proliferative breast cancers repeatedly identified T/B cell markers as favorable prognosis factors. In glioblastoma, SCoR identified a common good prognostic signature of chromosome 10 genes from two gene expression datasets (TCGA and REMBRANDT), recapitulating the fact that loss of one copy of chromosome 10 (which harbors the tumor suppressor PTEN) is linked to poor survival in glioblastoma patients. SCoR also identified prognostic genes on sex chromosomes in lung adenocarcinomas, suggesting patient gender might be used to predict outcome in this disease. These results demonstrate the power of SCoR to identify common and biologically meaningful prognostic gene expression signatures.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom