Premium
Extremum seeking for optimal control problems with unknown time‐varying systems and unknown objective functions
Author(s) -
Scheinker Alexander,
Scheinker David
Publication year - 2021
Publication title -
international journal of adaptive control and signal processing
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.73
H-Index - 66
eISSN - 1099-1115
pISSN - 0890-6327
DOI - 10.1002/acs.3097
Subject(s) - control theory (sociology) , noise (video) , range (aeronautics) , controller (irrigation) , linear quadratic regulator , set (abstract data type) , computer science , function (biology) , quadratic equation , optimal control , mathematical optimization , mathematics , control (management) , engineering , artificial intelligence , agronomy , programming language , geometry , evolutionary biology , image (mathematics) , biology , aerospace engineering
Summary We consider the problem of optimal feedback control of an unknown, noisy, time‐varying, dynamic system that is initialized repeatedly. Examples include a robotic manipulator which must perform the same motion, such as assisting a human, repeatedly and accelerating cavities in particle accelerators which are turned on for a fraction of a second with given initial conditions and vary slowly due to temperature fluctuations. We present an approach that applies to systems of practical interest. The method presented here is model independent; does not require knowledge of the objective function; is robust to measurement noise; is applicable for any set of initial conditions; is applicable to simultaneously controlling an arbitrary number of parameters; and may be implemented with a broad range of continuous or discontinuous functions such as sine or square waves. For systems with convex cost functions we prove that our algorithm will produce controllers that approach the minimal cost. For linear systems we reproduce the cost minimizing linear quadratic regulator optimal controller that could have been designed analytically had the system and cost function been known. We demonstrate the effectiveness of the algorithm with simulation studies of noisy and time‐varying systems.