Event-Triggered Optimal Neuro-Controller Design With Reinforcement Learning for Unknown Nonlinear Systems | Zendy

Xiong Yang | Zendy; Haibo He | Zendy; Derong Liu | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Event-Triggered Optimal Neuro-Controller Design With Reinforcement Learning for Unknown Nonlinear Systems

Author(s) -

Xiong Yang,

Haibo He,

Derong Liu

Publication year - 2017

Publication title -

ieee transactions on systems man and cybernetics systems

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 2.261

H-Index - 64

eISSN - 2168-2232

pISSN - 2168-2216

DOI - 10.1109/tsmc.2017.2774602

Subject(s) - control theory (sociology) , identifier , computer science , controller (irrigation) , reinforcement learning , artificial neural network , nonlinear system , feed forward , gradient descent , event (particle physics) , stability (learning theory) , optimal control , lyapunov function , artificial intelligence , control engineering , control (management) , mathematics , engineering , mathematical optimization , machine learning , physics , quantum mechanics , agronomy , biology , programming language

This paper develops an optimal control scheme for continuous-time unknown nonlinear systems using the event-triggering mechanism. Different from designing controllers using the time-triggering mechanism, the event-triggered controller is updated only when the system state deviates more than a certain threshold from a prescribed value. To obtain the event-triggered optimal controller, we develop an identifier-critic architecture under the framework of reinforcement learning. The identifier network, composed of a feedforward neural network (FNN), aims to derive the knowledge of unknown system dynamics, and the critic network, constituted of an FNN, intends to derive the event-triggered optimal controller. The identifier network is tuned via the combination of a standard back-propagation algorithm and an ${e}$ -modification method, and the critic network is updated using a modification of the gradient descent method. By introducing an additional stability term to update the critic network, the initial admissible control is no longer required. Meanwhile, by using historical and instantaneous state data together, the persistence of excitation condition is relaxed. A stability analysis of the closed-loop system is provided based on the Lyapunov method. The effectiveness of the proposed designs is illustrated through simulations of a nonlinear example and a single link robot arm system.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research