Premium
On the efficacy of the rank transformation in stepwise logistic and discriminant analysis
Author(s) -
O'gorman Thomas W.,
Woolson Robert F.
Publication year - 1993
Publication title -
statistics in medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.996
H-Index - 183
eISSN - 1097-0258
pISSN - 0277-6715
DOI - 10.1002/sim.4780120206
Subject(s) - linear discriminant analysis , logistic regression , statistics , rank (graph theory) , ranking (information retrieval) , mathematics , stepwise regression , discriminant , feature selection , optimal discriminant analysis , selection (genetic algorithm) , computer science , artificial intelligence , combinatorics
Abstract We have evaluated the performance of four stepwise variable selection procedures commonly used in medical and epidemiologic research. The four procedures are discriminant and logistic regression and their rank transformed versions, where the independent variables are replaced by their ranks. We generated, by computer, data for two groups from several distributions with a variety of sample sizes and covariance matrices. The two ranking procedures each increased the chance of correctly selecting those variables related to group membership for data generated from log‐normal or contaminated distributions. For normally distributed data the ranking procedure had little effect on variable selection. Rank transformed discriminant analysis and rank transformed logistic regression were equally effective in selecting variables when sample sizes exceeded 100. Rank transformed discriminant analysis was superior for smaller data sets. We discuss the implications of the results of this study for clinical and epidemiologic research.