NUMERICAL ENCLOSURE FOR THE OPTIMAL THRESHOLD PROBABILITY IN DISCOUNTED MARKOV DECISION PROCESSES
Author(s) -
Kenji Toyonaga,
T. Mitsuhiro Nakao
Publication year - 2000
Publication title -
bulletin of informatics and cybernetics
Language(s) - English
Resource type - Journals
eISSN - 2435-743X
pISSN - 0286-522X
DOI - 10.5109/13494
Subject(s) - enclosure , mathematics , markov decision process , mathematical optimization , markov chain , statistics , markov process , computer science , telecommunications
There are various procedures to compute the optimal threshold probability in discounted Markov decision processes. In the actual numerical computation of an approximate optimal solution, the estimation of the discrepancy between the approximate solution and the exact solution is important. White(1993 b) derived such an error estimation for the value iteration method, however, this estimation is not actually in the computable form. In this paper, we present a numerical enclosure method to compute the optimal threshold probability, that guarantees a rigorous a posteriori error bound.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom