
Regret Bounds for Learning State Representations in Reinforcement Learning
Author(s) -
Ronald Ortner,
Matteo Pirotta,
Alessandro Lazaric,
Ronan Fruit,
Odalric-Ambrym Maillard
Publication year - 2019
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - regret , markov decision process , reinforcement learning , q learning , state space , representation (politics) , markov chain , computer science , markov process , artificial intelligence , state (computer science) , mathematical optimization , mathematics , machine learning , algorithm , statistics , politics , political science , law