
Regret Minimization in MDPs with Options without Prior Knowledge
Author(s) -
Ronan Fruit,
Matteo Pirotta,
Alessandro Lazaric,
Emma Brunskill
Publication year - 2017
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - regret , markov decision process , reinforcement learning , abstraction , computer science , mathematical optimization , term (time) , artificial intelligence , markov chain , machine learning , markov process , mathematics , statistics , philosophy , physics , epistemology , quantum mechanics