Acceleration of Reinforcement Learning with Incomplete Prior Information
Author(s) -
Kento Terashima,
Hirotaka Takano,
Junichi Murata
Publication year - 2013
Publication title -
journal of advanced computational intelligence and intelligent informatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.172
H-Index - 20
eISSN - 1883-8014
pISSN - 1343-0130
DOI - 10.20965/jaciii.2013.p0721
Subject(s) - reinforcement learning , computer science , forgetting , acceleration , prior information , artificial intelligence , process (computing) , machine learning , cognitive psychology , psychology , classical mechanics , physics , operating system
Reinforcement learning is applicable to complex or unknown problems because the solution search process is done by trial-and-error. However, the calculation time for the trial-and-error search becomes larger as the scale of the problem increases. Therefore, in order to decrease calculation time, some methods have been proposed using the prior information on the problem. This paper improves a previously proposed method utilizing options as prior information. In order to increase the learning speed even with wrong options, methods for option correction by forgetting the policy and extending initiation sets are proposed.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom