Using temporal neighborhoods to adapt function approximators in reinforcement learning
Author(s) -
R. Matthew Kretchmar,
Charles W. Anderson
Publication year - 1999
Publication title -
lecture notes in computer science
Language(s) - English
Resource type - Book series
SCImago Journal Rank - 0.249
H-Index - 400
eISSN - 1611-3349
pISSN - 0302-9743
ISBN - 3-540-66069-0
DOI - 10.1007/bfb0098206
Subject(s) - reinforcement learning , curse of dimensionality , computer science , temporal difference learning , basis (linear algebra) , artificial intelligence , bellman equation , basis function , function (biology) , class (philosophy) , dimension (graph theory) , machine learning , mathematics , mathematical optimization , mathematical analysis , geometry , evolutionary biology , pure mathematics , biology
To avoid the curse of dimensionality, function approximators areused in reinforcement learning to learn value functions for individualstates. In order to make better use of computational resources(basis functions) many researchers are investigating ways to adaptthe basis functions during the learning process so that they betterfit the value-function landscape. Here we introduce temporal neighborhoods as small groups of states that experience frequent intragrouptransitions during...
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom