
Constructed Analogs and Linear Regression
Author(s) -
Michael K. Tippett,
Timothy DelSole
Publication year - 2013
Publication title -
monthly weather review
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.862
H-Index - 179
eISSN - 1520-0493
pISSN - 0027-0644
DOI - 10.1175/mwr-d-12-00223.1
Subject(s) - overfitting , linear regression , principal component regression , proper linear model , mathematics , regression , linear model , linear predictor function , regression diagnostic , statistics , regression analysis , principal component analysis , bayesian multivariate linear regression , econometrics , computer science , artificial intelligence , artificial neural network
The constructed analog procedure produces a statistical forecast that is a linear combination of past predictand values. The weights used to form the linear combination depend on the current predictor value and are chosen so that the linear combination of past predictor values approximates the current predictor value. The properties of the constructed analog method have previously been described as being distinct from those of linear regression. However, here the authors show that standard implementations of the constructed analog method give forecasts that are identical to linear regression forecasts. A consequence of this equivalence is that constructed analog forecasts based on many predictors tend to suffer from overfitting just as in linear regression. Differences between linear regression and constructed analog forecasts only result from implementation choices, especially ones related to the preparation and truncation of data. Two particular constructed analog implementations are shown to correspond to principal component regression and ridge regression. The equality of linear regression and constructed analog forecasts is illustrated in a Niño-3.4 prediction example, which also shows that increasing the number of predictors results in low-skill, high-variance forecasts, even at long leads, behavior typical of overfitting. Alternative definitions of the analog weights lead naturally to nonlinear extensions of linear regression such as local linear regression.