z-logo
open-access-imgOpen Access
Online Learning in Episodic Markovian Decision Processes by Relative Entropy Policy Search
Author(s) -
Alexander Zimin,
Gergely Neu
Publication year - 2013
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - regret , markov decision process , entropy (arrow of time) , markov process , computer science , finite state , markov chain , state space , mathematics , artificial intelligence , mathematical optimization , machine learning , statistics , physics , quantum mechanics

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here