
Interactive Value Iteration for Markov Decision Processes with Unknown Rewards
Author(s) -
Paul Weng,
Bruno Zanuttini
Publication year - 2013
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - markov decision process , computer science , value (mathematics) , markov process , markov chain , artificial intelligence , machine learning , mathematics , statistics