Iterative Learning without Reinforcement or Reward for Multijoint Movements: A Revisit of Bernstein's DOF Problem on Dexterity
Author(s) -
Suguru Arimoto,
Masahiro Sekimoto,
Kenji Tahara
Publication year - 2010
Publication title -
journal of robotics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.303
H-Index - 14
eISSN - 1687-9619
pISSN - 1687-9600
DOI - 10.1155/2010/217867
Subject(s) - computer science , kinematics , inverse kinematics , reinforcement learning , robotics , artificial intelligence , robotic arm , trajectory , degrees of freedom (physics and chemistry) , trace (psycholinguistics) , robot , control theory (sociology) , linguistics , philosophy , physics , control (management) , classical mechanics , quantum mechanics , astronomy
A robot designed to mimic a human becomes kinematically redundant and its total degrees of freedom becomes larger than the number of physical variables required for describing a given task. Kinematic redundancy may contribute to enhancement of dexterity and versatility but it incurs a problem of ill-posedness of inverse kinematics from the taskspace to the joint space. This ill-posedness was originally found by Bernstein, who tried to unveil the secret of thecentral nervous system and how nicely it coordinates a skeletomotor system with many DOFs interacting in complex ways. Inthe history of robotics research, such ill-posedness has not yet been resolved directly but circumvented by introducingan artificial performance index and determining uniquely an inverse kinematics solution by minimization. This paper tacklessuch Bernstein's problem and proposes a new method for resolving the ill-posedness in a natural way without invokingany artificial index. First, given a curve on a horizontal plane for a redundant robot arm whose endpoint is imposed to tracethe curve, the existence of a unique ideal joint trajectory is proved. Second, such a uniquely determined motion can beacquired eventually as a joint control signal through iterative learning without reinforcement or reward
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom