Premium
A mathematical programming approach for improving the robustness of least sum of absolute deviations regression
Author(s) -
Giloni Avi,
Sengupta Bhaskar,
Simonoff Jeffrey S.
Publication year - 2006
Publication title -
naval research logistics (nrl)
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.665
H-Index - 68
eISSN - 1520-6750
pISSN - 0894-069X
DOI - 10.1002/nav.20139
Subject(s) - least absolute deviations , robustness (evolution) , knapsack problem , estimator , mathematical optimization , mathematics , linear programming , integer programming , robust optimization , robust regression , nonlinear programming , linear regression , regression analysis , regression , computer science , nonlinear system , statistics , biochemistry , chemistry , physics , quantum mechanics , gene
This paper discusses a novel application of mathematical programming techniques to a regression problem. While least squares regression techniques have been used for a long time, it is known that their robustness properties are not desirable. Specifically, the estimators are known to be too sensitive to data contamination. In this paper we examine regressions based on Least‐sum of Absolute Deviations (LAD) and show that the robustness of the estimator can be improved significantly through a judicious choice of weights. The problem of finding optimum weights is formulated as a nonlinear mixed integer program, which is too difficult to solve exactly in general. We demonstrate that our problem is equivalent to a mathematical program with a single functional constraint resembling the knapsack problem and then solve it for a special case. We then generalize this solution to general regression designs. Furthermore, we provide an efficient algorithm to solve the general nonlinear, mixed integer programming problem when the number of predictors is small. We show the efficacy of the weighted LAD estimator using numerical examples. © 2006 Wiley Periodicals, Inc. Naval Research Logistics, 2006