
Episodic reinforcement learning in finite MDPs: Minimax lower bounds revisited
Author(s) -
Omar Domingues,
Pierre Ménard,
Emilie Kaufmann,
Michal Valko
Publication year - 2021
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - Uncategorized
Resource type - Conference proceedings
Subject(s) - reinforcement learning , minimax , computer science , mathematical optimization , artificial intelligence , mathematics