Premium
Reinforcement learning: Architectures and algorithms
Author(s) -
Kokar Mieczyslaw M.,
Reveliotis Spiridon A.
Publication year - 1993
Publication title -
international journal of intelligent systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.291
H-Index - 87
eISSN - 1098-111X
pISSN - 0884-8173
DOI - 10.1002/int.4550080805
Subject(s) - reinforcement learning , computer science , reinforcement , perception , process (computing) , artificial intelligence , error driven learning , state (computer science) , architecture , algorithm , engineering , psychology , operating system , art , visual arts , structural engineering , neuroscience
This article is related to the research effort of constructing an intelligent agent, i.e., a computer system that is able to sense its environment (world), reason utilizing its internal knowledge and execute actions upon the world (act). the specific part of this effor presented in this article is reinforcement learning , i.e., the process of acquiring new knowledge based upon an evaluative feedback , called reinforcement , received by tht agent through interactions with the world. This article has two objectives: (1) to give a compact overview of reinforcement learning, and (2) to show that the evolution of the reinforcement learning paradigm has been driven by the need for more efficient learning through the addition of more structure to the learning agent. Therefore, both main ideas of reinforcement learning are introduced, and structural solutions to reinforcemen learning are reviewed. Several architectural enhancements of the RL paradigm are discussed. These include incorporation of state information in the learning process, architectural solutions to learning with delayed reinforcement, dealing with structurally changing worlds through utilization of multiple models of the world, and focusing attention of the learning agent through active perception. the paper closes with an overview of directions for applications and for future research in this area. © 1993 John Wiley & Sons, Inc.