z-logo
open-access-imgOpen Access
Mixing-Time Regularized Policy Gradient
Author(s) -
Tetsuro Morimura,
Takayuki Osogami,
Tomoyuki Shirai
Publication year - 2014
Publication title -
proceedings of the ... aaai conference on artificial intelligence
Language(s) - Uncategorized
Resource type - Journals
eISSN - 2374-3468
pISSN - 2159-5399
DOI - 10.1609/aaai.v28i1.9013
Subject(s) - mixing (physics) , markov chain , hitting time , reinforcement learning , computer science , temporal difference learning , markov process , mathematical optimization , markov property , mathematics , markov model , artificial intelligence , statistics , machine learning , mathematical analysis , physics , quantum mechanics

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here