z-logo
open-access-imgOpen Access
Softened approximate policy iteration for Markov games
Author(s) -
Julien Pérolat,
Bilal Piot,
Matthieu Geist,
Bruno Scherrer,
Olivier Pietquin
Publication year - 2016
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - Uncategorized
Resource type - Conference proceedings
Subject(s) - computer science , residual , mathematical optimization , markov decision process , norm (philosophy) , minification , stability (learning theory) , markov chain , algorithm , markov process , mathematics , machine learning , statistics , political science , law

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom