Premium
Evolutionary reinforcement learning system with time‐varying parameters
Author(s) -
Umesako Kosuke,
Obayashi Masanao,
Kobayashi Kunikazu
Publication year - 2006
Publication title -
electrical engineering in japan
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.136
H-Index - 28
eISSN - 1520-6416
pISSN - 0424-7760
DOI - 10.1002/eej.20170
Subject(s) - reinforcement learning , state space , variation (astronomy) , computer science , state (computer science) , dynamical system (definition) , closing (real estate) , artificial intelligence , evolutionary robotics , dynamical systems theory , robot , control theory (sociology) , control (management) , mathematics , physics , algorithm , statistics , quantum mechanics , astrophysics , political science , law
In this paper, an evolutionary reinforcement learning system with time‐varying parameters that can learn appropriate policy in dynamical POMDPs is proposed. The proposed system has time‐varying parameters that can be adjusted by using reinforcement learning. Hence, the system can adapt to the time variation of the dynamical environment even if its variation cannot be observed. In addition, the state space of the environment is divided evolutionarily. Thus, one need not divide the state space in advance. The efficacy of the proposed system is shown by mobile robot control simulation under the environment belonging to dynamical POMDPs. The environment is the passage that has gates iterate opening and closing. © 2006 Wiley Periodicals, Inc. Electr Eng Jpn, 156(1): 54–60, 2006; Published online in Wiley InterScience ( www.interscience.wiley.com ). DOI 10.1002/eej.20170