Statistical Quasi-Newton: A New Look at Least Change
Author(s) -
Chuanhai Liu,
Scott Vander Wiel
Publication year - 2007
Publication title -
siam journal on optimization
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.066
H-Index - 136
eISSN - 1095-7189
pISSN - 1052-6234
DOI - 10.1137/040614700
Subject(s) - hessian matrix , broyden–fletcher–goldfarb–shanno algorithm , quasi newton method , mathematics , range (aeronautics) , eigenvalues and eigenvectors , wishart distribution , simple (philosophy) , mathematical optimization , newton's method , algorithm , computer science , statistics , nonlinear system , multivariate statistics , composite material , computer network , philosophy , physics , materials science , asynchronous communication , epistemology , quantum mechanics
A new method for quasi-Newton minimization outperforms BFGS by combining least-change updates of the Hessian with step sizes estimated from a Wishart model of uncertainty. The Hessian update is in the Broyden family but uses a negative parameter, outside the convex range, that is usually regarded as the safe zone for Broyden updates. Although full Newton steps based on this update tend to be too long, excellent performance is obtained with shorter steps estimated from the Wishart model. In numerical comparisons to BFGS the new statistical quasi-Newton (SQN) algorithm typically converges with about 25% fewer iterations, functions, and gradient evaluations on the top 1/3 hardest unconstrained problems in the CUTE library. Typical improvement on the 1/3 easiest problems is about 5%. The framework used to derive SQN provides a simple way to understand differences among various Broyden updates such as BFGS and DFP and shows that these methods do not preserve accuracy of the Hessian, in a certain sense, while the new method does. In fact, BFGS, DFP, and all other updates with nonnegative Broyden parameters tend to inflate Hessian estimates, and this accounts for their observed propensity to correct eigenvalues that are too small more readily than eigenvalues that are too large. Numerical results on three new test functions validate these conclusions.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom