z-logo
Premium
Markov decision processes with restricted observations: Finite horizon case
Author(s) -
Serin Yasemin,
Avsar Zeynep Muge
Publication year - 1997
Publication title -
naval research logistics (nrl)
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.665
H-Index - 68
eISSN - 1520-6750
pISSN - 0894-069X
DOI - 10.1002/(sici)1520-6750(199708)44:5<439::aid-nav3>3.0.co;2-5
Subject(s) - unobservable , markov decision process , observability , partially observable markov decision process , observable , markov process , state (computer science) , mathematical economics , mathematical optimization , decision problem , time horizon , markov chain , horizon , mathematics , computer science , econometrics , algorithm , statistics , physics , geometry , quantum mechanics
In this article we consider a Markov decision process subject to the constraints that result from some observability restrictions. We assume that the state of the Markov process under consideration is unobservable. The states are grouped so that the group that a state belongs to is observable. So, we want to find an optimal decision rule depending on the observable groups instead of the states. This means that the same decision applies to all the states in the same group. We prove that a deterministic optimal policy exists for the finite horizon. An algorithm is developed to compute policies minimizing the total expected discounted cost over a finite horizon. © 1997 John Wiley & Sons, Inc. Naval Research Logistics 44 : 439–456, 1997

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here