Open Access
Adaptive Step-Size for Online Temporal Difference Learning
Author(s) -
William Dabney,
Andrew G. Barto
Publication year - 2021
Publication title -
proceedings of the ... aaai conference on artificial intelligence
Language(s) - Uncategorized
Resource type - Journals
eISSN - 2374-3468
pISSN - 2159-5399
DOI - 10.1609/aaai.v26i1.8313
Subject(s) - upper and lower bounds , heuristic , computer science , online algorithm , range (aeronautics) , adaptive learning , gradient descent , temporal difference learning , function (biology) , function approximation , algorithm , key (lock) , artificial intelligence , mathematical optimization , mathematics , reinforcement learning , artificial neural network , mathematical analysis , materials science , evolutionary biology , composite material , biology , computer security