Decentralized Coordinated Motion Control of Two Hydraulic Actuators Handling a Common Object
Author(s) -
Mark Karpenko,
Nariman Sepehri,
John Anderson
Publication year - 2007
Publication title -
journal of dynamic systems measurement and control
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.528
H-Index - 89
eISSN - 1528-9028
pISSN - 0022-0434
DOI - 10.1115/1.2764516
Subject(s) - reinforcement learning , actuator , trajectory , computer science , task (project management) , object (grammar) , control theory (sociology) , position (finance) , artificial intelligence , control engineering , control (management) , engineering , physics , systems engineering , finance , astronomy , economics
In this paper, reinforcement learning is applied to coordinate, in a decentralized fashion, the motions of a pair of hydraulic actuators whose task is to firmly hold and move an object along a specified trajectory under conventional position control. The learning goal is to reduce the interaction forces acting on the object that arise due to inevitable positioning errors resulting from the imperfect closed-loop actuator dynamics. Each ac-tuator is therefore outfitted with a reinforcement learning neural network that modifies a centrally planned formation constrained position trajectory in response to the locally measured interaction force. It is shown that the actuators, which form a multiagent learning system, can learn decentralized control strategies that reduce the object inter-action forces and thus greatly improve their coordination on the manipulation task. However, the problem of credit assignment, a common difficulty in multiagent learning systems, prevents the actuators from learning control strategies where each actuator contributes equally to reducing the interaction force. This problem is resolved in this paper via the periodic communication of limited local state information between the reinforcement learning actuators. Using both simulations and experiments, this paper examines some of the issues pertaining to learning in dynamic multiagent environments and establishes reinforcement learning as a potential technique for coordinating several nonlinear hydraulic manipulators performing a common task. DOI: 10.1115/1.2764516
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom