Predicting survival from microarray data—a comparative study
Author(s) -
Hege Bøvelstad,
Ståle Nygård,
Hege Leite Størvold,
Magne Aldrin,
Ørnulf Borgan,
Arnoldo Frigessi,
Ole Christian Lingjærde
Publication year - 2007
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/btm305
Subject(s) - lasso (programming language) , univariate , principal component regression , dimensionality reduction , regression analysis , computer science , regression , statistics , feature selection , linear regression , principal component analysis , proportional hazards model , elastic net regularization , data mining , mathematics , artificial intelligence , multivariate statistics , world wide web
Survival prediction from gene expression data and other high-dimensional genomic data has been subject to much research during the last years. These kinds of data are associated with the methodological problem of having many more gene expression values than individuals. In addition, the responses are censored survival times. Most of the proposed methods handle this by using Cox's proportional hazards model and obtain parameter estimates by some dimension reduction or parameter shrinkage estimation technique. Using three well-known microarray gene expression data sets, we compare the prediction performance of seven such methods: univariate selection, forward stepwise selection, principal components regression (PCR), supervised principal components regression, partial least squares regression (PLS), ridge regression and the lasso.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom