Reinforcement learning based closed‐loop reference model adaptive flight control system design | Zendy

Yuksek Burak | Zendy; Inalhan Gokhan | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Reinforcement learning based closed‐loop reference model adaptive flight control system design

Author(s) -

Yuksek Burak,

Inalhan Gokhan

Publication year - 2021

Publication title -

international journal of adaptive control and signal processing

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.73

H-Index - 66

eISSN - 1099-1115

pISSN - 0890-6327

DOI - 10.1002/acs.3181

Subject(s) - reinforcement learning , computer science , benchmark (surveying) , control theory (sociology) , parametric statistics , reference model , control engineering , transient (computer programming) , adaptive control , flight simulator , adaptation (eye) , engineering , simulation , control (management) , artificial intelligence , physics , mathematics , geodesy , software engineering , statistics , optics , geography , operating system

Summary In this study, we present a reinforcement learning (RL)‐based flight control system design method to improve the transient response performance of a closed‐loop reference model (CRM) adaptive control system. The methodology, known as RL‐CRM, relies on the generation of a dynamic adaption strategy by implementing RL on the variable factor in the feedback path gain matrix of the reference model. An actor‐critic RL agent is designed using the performance‐driven reward functions and tracking error observations from the environment. In the training phase, a deep deterministic policy gradient algorithm is utilized to learn the time‐varying adaptation strategy of the design parameter in the reference model feedback gain matrix. The proposed control structure provides the possibility to learn numerous adaptation strategies across a wide range of flight and vehicle conditions instead of being driven by high‐fidelity simulators or flight testing and real flight operations. The performance of the proposed system was evaluated on an identified and verified mathematical model of an agile quadrotor platform. Monte‐Carlo simulations and worst case analysis were also performed over a benchmark helicopter example model. In comparison to the classical model reference adaptive control and CRM‐adaptive control system designs, the proposed RL‐CRM adaptive flight control system design improves the transient response performance on all associated metrics and provides the capability to operate over a wide range of parametric uncertainties.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore