Predicting future responses based on possibly mis-specified working models
Author(s) -
Tommaso Cai,
Lü Tian,
Scott D. Solomon,
L. J. Wei
Publication year - 2007
Publication title -
biometrika
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.307
H-Index - 122
eISSN - 1464-3510
pISSN - 0006-3444
DOI - 10.1093/biomet/asm078
Subject(s) - covariate , mathematics , set (abstract data type) , binary number , outcome (game theory) , sample (material) , statistics , regression , sample size determination , econometrics , computer science , chemistry , arithmetic , mathematical economics , chromatography , programming language
Under a general regression setting, we propose an optimal unconditional prediction procedure for future responses. The resulting prediction intervals or regions have a desirable average coverage level over a set of covariate vectors of interest. When the working model is not correctly specified, the traditional conditional prediction method is generally invalid. On the other hand, one can empirically calibrate the above unconditional procedure and also obtain its crossvalidated counterpart. Various large and small sample properties of these unconditional methods are examined analytically and numerically. We find that the 𝒦-fold crossvalidated procedure performs exceptionally well even for cases with rather small sample sizes. The new proposals are illustrated with two real examples, one with a continuous response and the other with a binary outcome. Copyright 2008, Oxford University Press.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom