Premium
Informing a risk prediction model for binary outcomes with external coefficient information
Author(s) -
Cheng Wenting,
Taylor Jeremy M. G.,
Gu Tian,
Tomlins Scott A.,
Mukherjee Bhramar
Publication year - 2019
Publication title -
journal of the royal statistical society: series c (applied statistics)
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.205
H-Index - 72
eISSN - 1467-9876
pISSN - 0035-9254
DOI - 10.1111/rssc.12306
Subject(s) - logistic regression , statistics , regression analysis , calculator , computer science , regression , mathematics , linear regression , binary number , set (abstract data type) , data mining , arithmetic , programming language , operating system
Summary We consider a situation where rich historical data are available for the coefficients and their standard errors in an established regression model describing the association between a binary outcome variable Y and a set of predicting factors X , from a large study. We would like to utilize this summary information for improving estimation and prediction in an expanded model of interest, Y | X , B . The additional variable B is a new biomarker, measured on a small number of subjects in a new data set. We develop and evaluate several approaches for translating the external information into constraints on regression coefficients in a logistic regression model of Y | X , B . Borrowing from the measurement error literature we establish an approximate relationship between the regression coefficients in the models Pr( Y =1| X , β ), Pr( Y =1| X , B , γ ) and E ( B | X , θ ) for a Gaussian distribution of B . For binary B we propose an alternative expression. The simulation results comparing these methods indicate that historical information on Pr( Y =1| X , β ) can improve the efficiency of estimation and enhance the predictive power in the regression model of interest Pr( Y =1| X , B , γ ). We illustrate our methodology by enhancing the high grade prostate cancer prevention trial risk calculator, with two new biomarkers: prostate cancer antigen 3 and TMPRSS2:ERG.