
Approximate Policy Iteration Schemes: A Comparison
Author(s) -
Scherrer, Bruno
Publication year - 2014
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - Uncategorized
Resource type - Conference proceedings
Subject(s) - markov decision process , dynamic programming , mathematical optimization , computer science , horizon , time horizon , mathematics , constant (computer programming) , markov process , statistics , geometry , programming language