z-logo
open-access-imgOpen Access
Event-Triggered Optimal Neuro-Controller Design With Reinforcement Learning for Unknown Nonlinear Systems
Author(s) -
Xiong Yang,
Haibo He,
Derong Liu
Publication year - 2017
Publication title -
ieee transactions on systems man and cybernetics systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.261
H-Index - 64
eISSN - 2168-2232
pISSN - 2168-2216
DOI - 10.1109/tsmc.2017.2774602
Subject(s) - control theory (sociology) , identifier , computer science , controller (irrigation) , reinforcement learning , artificial neural network , nonlinear system , feed forward , gradient descent , event (particle physics) , stability (learning theory) , optimal control , lyapunov function , artificial intelligence , control engineering , control (management) , mathematics , engineering , mathematical optimization , machine learning , physics , quantum mechanics , agronomy , biology , programming language
This paper develops an optimal control scheme for continuous-time unknown nonlinear systems using the event-triggering mechanism. Different from designing controllers using the time-triggering mechanism, the event-triggered controller is updated only when the system state deviates more than a certain threshold from a prescribed value. To obtain the event-triggered optimal controller, we develop an identifier-critic architecture under the framework of reinforcement learning. The identifier network, composed of a feedforward neural network (FNN), aims to derive the knowledge of unknown system dynamics, and the critic network, constituted of an FNN, intends to derive the event-triggered optimal controller. The identifier network is tuned via the combination of a standard back-propagation algorithm and an ${e}$ -modification method, and the critic network is updated using a modification of the gradient descent method. By introducing an additional stability term to update the critic network, the initial admissible control is no longer required. Meanwhile, by using historical and instantaneous state data together, the persistence of excitation condition is relaxed. A stability analysis of the closed-loop system is provided based on the Lyapunov method. The effectiveness of the proposed designs is illustrated through simulations of a nonlinear example and a single link robot arm system.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom