Policy Iteration Algorithms for DEC-POMDPs with discounted rewards
Author(s) -
Jilles Dibangoye,
Braim Chaid-Draa,
AbdelIllah Mouaddib
Publication year - 2009
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - partially observable markov decision process , markov decision process , computer science , scalability , markov process , mathematical optimization , process (computing) , algorithm , scale (ratio) , quality (philosophy) , observable , markov chain , machine learning , markov model , mathematics , statistics , philosophy , physics , epistemology , quantum mechanics , database , operating system
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom