Processos de decisão de Markov com sensibilidade a risco com função de utilidade exponencial: Uma revisão sistemática da literatura
Author(s) -
E. M. de Freitas,
Karina Delgado,
Valdinei Silva
Publication year - 2017
Publication title -
anais do simpósio brasileiro de sistemas de informação (sbsi)
Language(s) - English
Resource type - Conference proceedings
DOI - 10.5753/sbsi.2017.6045
Subject(s) - markov decision process , markov process , markov chain , computer science , humanities , partially observable markov decision process , process (computing) , decision process , philosophy , mathematics , statistics , machine learning , engineering , programming language , process management
Markov Decision Process (MDP) has been used very eciently to solve sequential decision-making problems. There are problems in which dealing with the risks of the environment to obtain a reliable result is more important than maximizing the expected average return. MDPs that deal with this type of problem are called risk-sensitive Markov decision processes (RSMDP). This systematic review of the literature aims to identify the theoretical results and proposed algorithms to solve RSMDP problems that have an exponential utility function, evaluating their main characteristics, similarities, particularities and differences in order to allow the reader the knowledge of this tool of decision making for risk sensitive problems.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom