NEARLY OPTIMAL POLICIES AND STOPPING TIMES IN MARKOV DECISION PROCESSES WITH GENERAL REWARDS
Author(s) -
Nagata Furukawa
Publication year - 1980
Publication title -
bulletin of mathematical statistics
Language(s) - English
Resource type - Journals
ISSN - 0007-4993
DOI - 10.5109/13142
Subject(s) - markov decision process , optimal stopping , stopping time , mathematics , markov chain , markov process , mathematical optimization , computer science , statistics
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom