Learning Quadcopter Maneuvers with Concurrent Methods of Policy Optimization | Zendy

Pei-Hua Huang | Zendy; Osamu Hasegawa | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Learning Quadcopter Maneuvers with Concurrent Methods of Policy Optimization

Author(s) -

Pei-Hua Huang,

Osamu Hasegawa

Publication year - 2017

Publication title -

journal of advanced computational intelligence and intelligent informatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.172

H-Index - 20

eISSN - 1343-0130

pISSN - 1883-8014

DOI - 10.20965/jaciii.2017.p0639

Subject(s) - computer science , quadcopter , reinforcement learning , asynchronous communication , reliability (semiconductor) , artificial intelligence , control engineering , engineering , aerospace engineering , computer network , power (physics) , physics , quantum mechanics

This study presents an aerial robotic application of deep reinforcement learning that imparts an asynchronous learning framework and trust region policy optimization to a simulated quad-rotor helicopter (quadcopter) environment. In particular, we optimized a control policy asynchronously through interaction with concurrent instances of the environment. The control system was benchmarked and extended with examples to tackle continuous state-action tasks for the quadcoptor: hovering control and balancing an inverted pole. Performing these maneuvers required continuous actions for sensitive control of small acceleration changes of the quadcoptor, thereby maximizing the scalar reward of the defined tasks. The simulation results demonstrated an enhancement of the learning speed and reliability for the tasks.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research