Interactive Value Iteration for Markov Decision Processes with Unknown Rewards
Author(s) -
Paul Weng,
Bruno Zanuttini
Publication year - 2013
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - markov decision process , computer science , task (project management) , set (abstract data type) , preference , markov process , value (mathematics) , tutor , markov chain , function (biology) , mathematical optimization , artificial intelligence , theoretical computer science , machine learning , mathematics , statistics , management , evolutionary biology , programming language , economics , biology
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom