
Navigation behavioural decision-making of MASS based on deep reinforcement learning and artificial potential field
Author(s) -
Chengbo Wang,
Xinyu Zhang,
Jiawei Zhang,
Zhiguo Ding,
AN Lan-xuan
Publication year - 2019
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1357/1/012026
Subject(s) - reinforcement learning , obstacle avoidance , motion planning , artificial intelligence , computer science , collision avoidance , bellman equation , obstacle , machine learning , mobile robot , simulation , collision , mathematics , mathematical optimization , robot , geography , computer security , archaeology
To realize intelligent obstacle avoidance and local path decisions for maritime autonomous surface ships (MASS) in uncertain environments, a navigation behavioural decision-making model based on deep reinforcement learning (DRL) algorithm improved by artificial potential field (APF) is proposed. Based on the analysis of navigation decision system and perception principle, the action space, reward function, motion search strategy and action value function are designed respectively for the purpose of steering to collision avoidance. The navigation behavioural decision-making model for MASS is improved by adding the prior information, the gravitational potential field and the obstacle repulsion potential field to update the initial action state value function and search path. Python and Pygame modules are used to build a simulation chart. Effectiveness of the algorithm is verified, with Tianjin Xingang port as a study case. The simulation results show that the APF-DRL algorithm is better than the DRL algorithm in training iteration time and piloting decision path, which improves the self-learning ability of MASS, and can meet the requirements of MASS path decision and adaptive obstacle avoidance.