Stepwise Model Fitting and Statistical Inference: Turning Noise into Signal Pollution
Author(s) -
Roger Mundry,
Charles L. Nunn
Publication year - 2008
Publication title -
the american naturalist
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.954
H-Index - 205
eISSN - 1537-5323
pISSN - 0003-0147
DOI - 10.1086/593303
Subject(s) - stepwise regression , statistical inference , inference , null hypothesis , statistics , statistical hypothesis testing , computer science , regression , statistical model , regression analysis , model selection , variables , type i and type ii errors , econometrics , artificial intelligence , machine learning , mathematics
Statistical inference based on stepwise model selection is applied regularly in ecological, evolutionary, and behavioral research. In addition to fundamental shortcomings with regard to finding the "best" model, stepwise procedures are known to suffer from a multiple-testing problem, yet the method is still widely used. As an illustration of this problem, we present results of a simulation study of artificial data sets of uncorrelated variables, with two to 10 predictor variables and one dependent variable. We then compared results from stepwise regression with a regression model in which all predictor variables were entered simultaneously. These analyses clearly demonstrate that significance tests based on stepwise procedures lead to greatly inflated Type I error rates (i.e., the probability of erroneously rejecting a true null hypothesis). By using a simple simulation design, our study amplifies previous warnings about using stepwise procedures, and we follow others in recommending that biologists refrain from applying these methods.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom