
Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration
Author(s) -
Priyank Agrawal,
JingLin Chen,
Nan Jiang
Publication year - 2021
Publication title -
proceedings of the ... aaai conference on artificial intelligence
Language(s) - Uncategorized
Resource type - Journals
eISSN - 2374-3468
pISSN - 2159-5399
DOI - 10.1609/aaai.v35i8.16813
Subject(s) - regret , markov decision process , mathematics , value (mathematics) , reinforcement learning , clipping (morphology) , mathematical optimization , upper and lower bounds , combinatorics , statistics , computer science , markov process , artificial intelligence , mathematical analysis , linguistics , philosophy