Human-aware Robot Collaborative Task Planning using Artificial Potential Field and DQN Reinforcement Learning | Zendy

P Jayesh | Zendy; Sam Altnji | Zendy; Karthick Thiyagarajan | Zendy; Jogesh S Nanda | Zendy; Abhijith Biswas | Zendy; Abhra Roy Chowdhury | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Human-aware Robot Collaborative Task Planning using Artificial Potential Field and DQN Reinforcement Learning

Author(s) -

P Jayesh,

Sam Altnji,

Karthick Thiyagarajan,

Jogesh S Nanda,

Abhijith Biswas,

Abhra Roy Chowdhury

Publication year - 2025

Publication title -

ieee access

Language(s) - English

Resource type - Magazines

SCImago Journal Rank - 0.587

H-Index - 127

eISSN - 2169-3536

DOI - 10.1109/access.2025.3595995

Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation

This paper presents a novel way for Robot-Robot-Human interaction in a shared workspace for collaborative tasks and uses a multi-modal means of communication that includes hand gestures, voice commands, end-effector gestures, and marker tracking. The system consists of a human operator working along with a Task robot (UR5) and a helper robot (OpenManipulatorX) to perform assembly and disassembly tasks. A Deep Q Network (DQN) reinforcement learning model is used to train the robot to perform the goal reaching task while avoiding obstacles to ensure safety. The DQN algorithm makes use of the end-effector position and the relative positions with the goal and obstacles to train a policy that guides the robot arm safely. Then 4 different training models are created and their ability to avoid obstacles and reach the goal are compared along with the point-to-point Bezier interpolation path planning method in different scenarios such as varying height, size, and number of obstacles. The proposed system has been simulated and then experimentally validated. Experimental results show that DQN trained model performed better than Bezier interpolation in reaching the final goal position with an accuracy of 74mm while avoiding obstacles at the same time in a shared environment. It is also observed that of the different trained models, the model with a larger action space and reduced observation space gave better results compared to others in terms of accuracy and goal completion rate. Also, from experimental data its observed that Improved Artificial Potential Field (IAPF) only took 4.7s as the median time to reach the goal whereas Goal Directed Approach (GDA) took 7.62s and Rapidly Exploring Random Tree Star (RRT*) took 6.22s in different scenarios.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research