Reward-Weighted Regression Converges to a Global Optimum
Author(s) -
Miroslav Štrupl,
Francesco Faccio,
Dylan R. Ashley,
Rupesh K. Srivastava,
Jürgen Schmidhuber
Publication year - 2022
Publication title -
proceedings of the aaai conference on artificial intelligence
Language(s) - Uncategorized
Resource type - Journals
eISSN - 2374-3468
pISSN - 2159-5399
DOI - 10.1609/aaai.v36i8.20811
Subject(s) - monotonic function , reinforcement learning , maximization , convergence (economics) , mathematical optimization , function (biology) , mathematics , bellman equation , importance sampling , regression , computer science , state space , statistics , artificial intelligence , economics , mathematical analysis , evolutionary biology , monte carlo method , biology , economic growth
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom