Merging with Extraction Method for Transfer Learning in Actor-Critic
Author(s) -
Toshiaki Takano,
Haruhiko Takase,
Hiroharu Kawanaka,
Shinji Tsuruoka
Publication year - 2011
Publication title -
journal of advanced computational intelligence and intelligent informatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.172
H-Index - 20
eISSN - 1343-0130
pISSN - 1883-8014
DOI - 10.20965/jaciii.2011.p0814
Subject(s) - computer science , reinforcement learning , transfer of learning , task (project management) , artificial intelligence , reuse , negative transfer , process (computing) , machine learning , action (physics) , simple (philosophy) , transfer (computing) , ecology , linguistics , philosophy , physics , management , epistemology , quantum mechanics , parallel computing , first language , economics , biology , operating system
This paper aims to accelerate learning process of actor-critic method, which is one of the major reinforcement learning algorithms, by a transfer learning. Transfer learning accelerates learning processes for the target task by reusing knowledge of source policies for each source task. In general, it consists of a selection phase and a training phase. Agents select source policies that are similar to the target one without trial and error, and train the target task by referring selected policies. In this paper, we discuss the training phase, and the rest of the training algorithm is based on our previous method. We proposed the effective transfer method that consists of the extractionmethod and the mergingmethod. Agents extract action preferences that are related to reliable states, and state values that lead to preferred states. Extracted parameters are merged into the current parameters by taking weighted average. We apply the proposed algorithm to simple maze tasks, and show the effectiveness of the proposed method: reduce 16% episodes and 55% failures without transfer.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom