z-logo
open-access-imgOpen Access
Topological Order Value Iteration Algorithm for Solving Probabilistic Planning
Author(s) -
Xiaofei Liu,
Mingjie Li,
Qingxin Nie
Publication year - 2013
Publication title -
communications and network
Language(s) - English
Resource type - Journals
eISSN - 1949-2421
pISSN - 1947-3826
DOI - 10.4236/cn.2013.51b020
Subject(s) - computer science , markov decision process , heuristics , probabilistic logic , exploit , backup , partially observable markov decision process , mathematical optimization , state space , state (computer science) , algorithm , space (punctuation) , sequence (biology) , markov chain , markov process , artificial intelligence , markov model , machine learning , mathematics , statistics , computer security , database , biology , genetics , operating system
AI researchers typically formulated probabilistic planning under uncertainty problems using Markov Decision Processes (MDPs).Value Iteration is an inef?cient algorithm for MDPs, because it puts the majority of its effort into backing up the entire state space, which turns out to be unnecessary in many cases. In order to overcome this problem, many approaches have been proposed. Among them, LAO*, LRTDP and HDP are state-of-the-art ones. All of these use reach ability analysis and heuristics to avoid some unnecessary backups. However, none of these approaches fully exploit the graphical features of the MDPs or use these features to yield the best backup sequence of the state space. We introduce an improved algorithm named Topological Order Value Iteration (TOVI) that can circumvent the problem of unnecessary backups by detecting the structure of MDPs and backing up states based on topological sequences. The experimental results demonstrate the effectiveness and excellent performance of our algorithm.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom