
IMED-RL: Regret optimal learning of ergodic Markov decision processes
Author(s) -
Fabien Pesquerel,
Odalric-Ambrym Maillard
Publication year - 2022
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - regret , markov decision process , ergodic theory , computer science , artificial intelligence , markov chain , markov process , machine learning , mathematical optimization , mathematics , statistics , mathematical analysis