
Human-aware Robot Collaborative Task Planning using Artificial Potential Field and DQN Reinforcement Learning
Author(s) -
P Jayesh,
Sam Altnji,
Karthick Thiyagarajan,
Jogesh S Nanda,
Abhijith Biswas,
Abhra Roy Chowdhury
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3595995
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
This paper presents a novel way for Robot-Robot-Human interaction in a shared workspace for collaborative tasks and uses a multi-modal means of communication that includes hand gestures, voice commands, end-effector gestures, and marker tracking. The system consists of a human operator working along with a Task robot (UR5) and a helper robot (OpenManipulatorX) to perform assembly and disassembly tasks. A Deep Q Network (DQN) reinforcement learning model is used to train the robot to perform the goal reaching task while avoiding obstacles to ensure safety. The DQN algorithm makes use of the end-effector position and the relative positions with the goal and obstacles to train a policy that guides the robot arm safely. Then 4 different training models are created and their ability to avoid obstacles and reach the goal are compared along with the point-to-point Bezier interpolation path planning method in different scenarios such as varying height, size, and number of obstacles. The proposed system has been simulated and then experimentally validated. Experimental results show that DQN trained model performed better than Bezier interpolation in reaching the final goal position with an accuracy of 74mm while avoiding obstacles at the same time in a shared environment. It is also observed that of the different trained models, the model with a larger action space and reduced observation space gave better results compared to others in terms of accuracy and goal completion rate. Also, from experimental data its observed that Improved Artificial Potential Field (IAPF) only took 4.7s as the median time to reach the goal whereas Goal Directed Approach (GDA) took 7.62s and Rapidly Exploring Random Tree Star (RRT*) took 6.22s in different scenarios.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom