
Batch Policy Iteration Algorithms for Continuous Domains
Author(s) -
Bilal Piot,
Matthieu Geist,
Olivier Pietquin
Publication year - 2016
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - Uncategorized
Resource type - Conference proceedings
Subject(s) - markov decision process , reinforcement learning , computer science , action (physics) , algorithm , markov process , adaptation (eye) , mathematical optimization , state (computer science) , hidden markov model , mathematics , artificial intelligence , statistics , physics , quantum mechanics , optics