Learning to steer nonlinear interior-point methods
Author(s) -
Renke Kuhlmann
Publication year - 2019
Publication title -
euro journal on computational optimization
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.95
H-Index - 14
eISSN - 2192-4414
pISSN - 2192-4406
DOI - 10.1007/s13675-019-00118-4
Subject(s) - solver , nonlinear system , computer science , convergence (economics) , sequence (biology) , reinforcement learning , point (geometry) , imitation , set (abstract data type) , monotone polygon , interior point method , task (project management) , mathematical optimization , algorithm , artificial intelligence , mathematics , engineering , physics , geometry , systems engineering , quantum mechanics , biology , economics , genetics , programming language , economic growth , psychology , social psychology
Interior-point or barrier methods handle nonlinear programs by sequentially solving barrier subprograms with a decreasing sequence of barrier parameters. The specific barrier update rule strongly influences the theoretical convergence properties as well as the practical efficiency. While many global and local convergence analyses consider a monotone update that decreases the barrier parameter for every approximately solved subprogram, computational studies show a superior performance of more adaptive strategies. In this paper we interpret the adaptive barrier update as a reinforcement learning task. A deep Q-learning agent is trained by both imitation and random action selection. Numerical results based on an implementation within the nonlinear programming solver WORHP show that the agent successfully learns to steer the barrier parameter and additionally improves WORHP’s performance on the CUTEst test set.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom