
FADQN: A Heuristic Reinforcement Learning Mechanism for UAV Path Planning in Unknown Environment
Author(s) -
Wei Sun,
Qi Wang,
Jing Huan,
Hualong Yu,
Shang Gao
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3591457
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Path planning remains a focal point in Unmanned Aerial Vehicle (UAV) research, with autonomous path planning in unknown environments emerging as a particularly active area. Deep Reinforcement Learning (DRL) has become a standard approach for addressing path planning challenges in such environments. However, traditional DRL algorithms often suffer from slow convergence. To address this issue, this paper introduces an enhanced deep reinforcement learning algorithm, termed Firefly Algorithm-enhanced Deep Q-Network (FADQN). In FADQN, the action output from the Firefly Algorithm is integrated into the Deep Q-Network to guide early action selection, thereby accelerating convergence. To balance exploration and exploitation, an adaptive ε-greedy strategy is proposed. Furthermore, a prioritized experience replay mechanism is employed to increase the likelihood of the agent learning from critical experiences. Simulation experiments demonstrate that FADQN achieves shorter paths (reducing the path length by 7.32% in complex environments), fewer turns, faster and more stable convergence, and higher success rates across various environments compared to traditional DQN. These results validate the effectiveness of FADQN in solving UAV path planning problems.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom