Opposition-Based Reinforcement Learning | Zendy

Hamid R. Tizhoosh | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Opposition-Based Reinforcement Learning

Author(s) -

Hamid R. Tizhoosh

Publication year - 2006

Publication title -

journal of advanced computational intelligence and intelligent informatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.172

H-Index - 20

eISSN - 1343-0130

pISSN - 1883-8014

DOI - 10.20965/jaciii.2006.p0578

Subject(s) - reinforcement learning , computer science , expediting , artificial intelligence , a priori and a posteriori , probabilistic logic , grid , machine learning , opposition (politics) , convergence (economics) , learning classifier system , engineering , mathematics , politics , law , political science , philosophy , geometry , systems engineering , epistemology , economics , economic growth

Reinforcement learning is a machine intelligence scheme for learning in highly dynamic, probabilistic environments. By interaction with the environment, reinforcement agents learn optimal control policies, especially in the absence of a priori knowledge and/or a sufficiently large amount of training data. Despite its advantages, however, reinforcement learning suffers from a major drawback – high calculation cost because convergence to an optimal solution usually requires that all states be visited frequently to ensure that policy is reliable. This is not always possible, however, due to the complex, high-dimensional state space in many applications. This paper introduces opposition-based reinforcement learning, inspired by opposition-based learning, to speed up convergence. Considering opposite actions simultaneously enables individual states to be updated more than once shortening exploration and expediting convergence. Three versions of Q-learning algorithm will be given as examples. Experimental results for the grid world problem of different sizes demonstrate the superior performance of the proposed approach.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research