z-logo
open-access-imgOpen Access
Adaptive Step-Size for Online Temporal Difference Learning
Author(s) -
William Dabney,
Andrew G. Barto
Publication year - 2021
Publication title -
proceedings of the aaai conference on artificial intelligence
Language(s) - Uncategorized
Resource type - Journals
eISSN - 2374-3468
pISSN - 2159-5399
DOI - 10.1609/aaai.v26i1.8313
Subject(s) - upper and lower bounds , heuristic , computer science , online algorithm , range (aeronautics) , adaptive learning , gradient descent , temporal difference learning , function (biology) , function approximation , algorithm , key (lock) , artificial intelligence , mathematical optimization , mathematics , reinforcement learning , artificial neural network , mathematical analysis , materials science , evolutionary biology , composite material , biology , computer security

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom