Episodic reinforcement learning in finite MDPs: Minimax lower bounds revisited
Author(s) -
Omar Domingues,
Pierre Ménard,
Emilie Kaufmann,
Michal Vaľko
Publication year - 2021
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - Uncategorized
Resource type - Conference proceedings
Subject(s) - reinforcement learning , minimax , computer science , mathematical optimization , artificial intelligence , mathematics
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom