z-logo
open-access-imgOpen Access
Planning in entropy-regularized Markov decision processes and games
Author(s) -
Jean-Bastien Grill,
Omar Darwiche Domingues,
Pierre Ménard,
Rémi Munos,
Michal Valko
Publication year - 2019
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - markov decision process , mathematical optimization , sample complexity , entropy (arrow of time) , computer science , markov process , markov chain , regularization (linguistics) , mathematics , bellman equation , computational complexity theory , partially observable markov decision process , cross entropy , operator (biology) , principle of maximum entropy , algorithm , artificial intelligence , machine learning , statistics , physics , biochemistry , chemistry , quantum mechanics , repressor , transcription factor , gene

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here