z-logo
Premium
The Use of Resampling Methods to Simplify Regression Models in Medical Statistics
Author(s) -
Sauerbrei Willi
Publication year - 1999
Publication title -
journal of the royal statistical society: series c (applied statistics)
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.205
H-Index - 72
eISSN - 1467-9876
pISSN - 0035-9254
DOI - 10.1111/1467-9876.00155
Subject(s) - resampling , logistic regression , statistics , computer science , regression , regression analysis , selection (genetic algorithm) , stepwise regression , stability (learning theory) , model selection , linear regression , econometrics , mathematics , machine learning
The number of variables in a regression model is often too large and a more parsimonious model may be preferred. Selection strategies (e.g. all‐subset selection with various penalties for model complexity, or stepwise procedures) are widely used, but there are few analytical results about their properties. The problems of replication stability, model complexity, selection bias and an over‐optimistic estimate of the predictive value of a model are discussed together with several proposals based on resampling methods. The methods are applied to data from a case–control study on atopic dermatitis and a clinical trial to compare two chemotherapy regimes by using a logistic regression and a Cox model. A recent proposal to use shrinkage factors to reduce the bias of parameter estimates caused by model building is extended to parameterwise shrinkage factors and is discussed as a further possibility to illustrate problems of models which are too complex. The results from the resampling approaches favour greater simplicity of the final regression model.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here