FADQN: A Heuristic Reinforcement Learning Mechanism for UAV Path Planning in Unknown Environment | Zendy

Wei Sun | Zendy; Qi Wang | Zendy; Jing Huan | Zendy; Hualong Yu | Zendy; Shang Gao | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

FADQN: A Heuristic Reinforcement Learning Mechanism for UAV Path Planning in Unknown Environment

Author(s) -

Wei Sun,

Qi Wang,

Jing Huan,

Hualong Yu,

Shang Gao

Publication year - 2025

Publication title -

ieee access

Language(s) - English

Resource type - Magazines

SCImago Journal Rank - 0.587

H-Index - 127

eISSN - 2169-3536

DOI - 10.1109/access.2025.3591457

Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation

Path planning remains a focal point in Unmanned Aerial Vehicle (UAV) research, with autonomous path planning in unknown environments emerging as a particularly active area. Deep Reinforcement Learning (DRL) has become a standard approach for addressing path planning challenges in such environments. However, traditional DRL algorithms often suffer from slow convergence. To address this issue, this paper introduces an enhanced deep reinforcement learning algorithm, termed Firefly Algorithm-enhanced Deep Q-Network (FADQN). In FADQN, the action output from the Firefly Algorithm is integrated into the Deep Q-Network to guide early action selection, thereby accelerating convergence. To balance exploration and exploitation, an adaptive ε-greedy strategy is proposed. Furthermore, a prioritized experience replay mechanism is employed to increase the likelihood of the agent learning from critical experiences. Simulation experiments demonstrate that FADQN achieves shorter paths (reducing the path length by 7.32% in complex environments), fewer turns, faster and more stable convergence, and higher success rates across various environments compared to traditional DQN. These results validate the effectiveness of FADQN in solving UAV path planning problems.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research