Navigation behavioural decision-making of MASS based on deep reinforcement learning and artificial potential field | Zendy

Chengbo Wang | Zendy; Xinyu Zhang | Zendy; Jiawei Zhang | Zendy; Zhiguo Ding | Zendy; AN Lan-xuan | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Navigation behavioural decision-making of MASS based on deep reinforcement learning and artificial potential field

Author(s) -

Chengbo Wang,

Xinyu Zhang,

Jiawei Zhang,

Zhiguo Ding,

AN Lan-xuan

Publication year - 2019

Publication title -

journal of physics. conference series

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.21

H-Index - 85

eISSN - 1742-6596

pISSN - 1742-6588

DOI - 10.1088/1742-6596/1357/1/012026

Subject(s) - reinforcement learning , obstacle avoidance , motion planning , artificial intelligence , computer science , collision avoidance , bellman equation , obstacle , machine learning , mobile robot , simulation , collision , mathematics , mathematical optimization , robot , geography , computer security , archaeology

To realize intelligent obstacle avoidance and local path decisions for maritime autonomous surface ships (MASS) in uncertain environments, a navigation behavioural decision-making model based on deep reinforcement learning (DRL) algorithm improved by artificial potential field (APF) is proposed. Based on the analysis of navigation decision system and perception principle, the action space, reward function, motion search strategy and action value function are designed respectively for the purpose of steering to collision avoidance. The navigation behavioural decision-making model for MASS is improved by adding the prior information, the gravitational potential field and the obstacle repulsion potential field to update the initial action state value function and search path. Python and Pygame modules are used to build a simulation chart. Effectiveness of the algorithm is verified, with Tianjin Xingang port as a study case. The simulation results show that the APF-DRL algorithm is better than the DRL algorithm in training iteration time and piloting decision path, which improves the self-learning ability of MASS, and can meet the requirements of MASS path decision and adaptive obstacle avoidance.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore