A rationally oriented forgettable profit sharing | Zendy

Koujaku Sadamori | Zendy; Watanabe Kota | Zendy; Igarashi Hajima | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

A rationally oriented forgettable profit sharing

Author(s) -

Koujaku Sadamori,

Watanabe Kota,

Igarashi Hajima

Publication year - 2013

Publication title -

electronics and communications in japan

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.131

H-Index - 13

eISSN - 1942-9541

pISSN - 1942-9533

DOI - 10.1002/ecj.11461

Subject(s) - irrational number , rationality , forgetting , profit sharing , computer science , reinforcement learning , profit (economics) , mathematical optimization , operations research , artificial intelligence , mathematics , microeconomics , economics , law , finance , psychology , geometry , political science , cognitive psychology

Summary In this paper, the Rationally Oriented Forgettable Profit Sharing method ( RFPS ) for reinforcement learning is proposed. Although profit sharing ( PS ) provides good performances in real environments, its learning is often slow in long‐term tasks because it is difficult to determine the appropriate discount rate which satisfies the Miyazaki rational theorem. There are several rationality‐relaxed PS methods which work well for such tasks. However, these PS methods may result in many irrational loops. The proposed method fulfills rationality by forgetting the reinforced irrational loops. This method can be easily combined with ordinary PS methods and performs well in long‐term tasks. Simulation results show that the proposed method can learn more efficiently than conventional PS methods. © 2013 Wiley Periodicals, Inc. Electron Comm Jpn, 96(7): 11–18, 2013; Published online in Wiley Online Library ( wileyonlinelibrary.com ). DOI 10.1002/ecj.11461

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research