Premium
ESTIMATING MULTIPLE REGRESSIONS IN m GROUPS: A CROSS‐VALIDATION STUDY
Author(s) -
Novick Melvin R.,
Jackson Paul H.,
Thayer Dorothy T.,
Cole Nancy S.
Publication year - 1972
Publication title -
british journal of mathematical and statistical psychology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.157
H-Index - 51
eISSN - 2044-8317
pISSN - 0007-1102
DOI - 10.1111/j.2044-8317.1972.tb00476.x
Subject(s) - statistics , mathematics , bayesian probability , sample size determination , regression , sample (material) , mean squared error , least squares function approximation , generalized least squares , regression analysis , cross validation , chemistry , chromatography , estimator
An improved Bayesian method, due to Lindley, for the simultaneous estimation of multiple regressions in m groups is studied by applying the method to test data. Evidence is found to support the belief that in many testing applications the collateral information obtained from each subset of m − 1 colleges will be useful for estimating the regression in the m th college, especially when the sample sizes are small. Using a 25 per cent sample, the Bayesian prediction equations achieved an average 9.7 per cent reduction in mean square error, as compared with the within‐group least squares equations, when cross‐validated with a later sample. More importantly, the mean square error for the Bayesian equations based on the 25 per cent sample was only barely greater than that for the least squares equations based on the full sample data. Thus the main virtue of the method is that it permits predictions to be made separately for relevant subpopulations (e.g. male‐female) where sample sizes would otherwise be too small to achieve an acceptable degree of accuracy.