Variable selection should be blinded to the outcome
Author(s) -
Tamás Ferenci
Publication year - 2017
Publication title -
international journal of epidemiology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.406
H-Index - 208
eISSN - 1464-3685
pISSN - 0300-5771
DOI - 10.1093/ije/dyx048
Subject(s) - outcome (game theory) , blinded study , medicine , selection (genetic algorithm) , computer science , artificial intelligence , mathematics , mathematical economics
When multivariate models are used for confounder adjustment, all potential confounders should be included in the model, or if not, their selection should be blinded to the outcome. Other methods, such as “pre-filtering” variables based on their univariate association with the outcome may give rise to biased regression coefficients, biased standard errors, biased confidence intervals, misspecified test distributions and exaggerated p-values. If variable selection is needed (for instance due to the high number of potential confounders) data reduction methods that are blinded to the outcome, modern approaches such as Bayesian Model Averaging or the application of shrinkage, i.e. penalization (regularization) of the regression are preferable.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom