Sequential Monte Carlo in reachability heuristics for probabilistic planning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Sequential Monte Carlo in reachability heuristics for probabilistic planning

Authors:	Daniel Bryce Subbarao Kambhampati David E. Smith

Affiliation:	1. SRI International, Artificial Intelligence Center, 333 Ravenswood Ave, Menlo Park, CA 94025, USA;2. Arizona State University, Department of Computer Science and Engineering, Brickyard Suite 501, 699 South Mill Avenue, Tempe, AZ 85281, USA;3. NASA Ames Research Center, Intelligent Systems Division, MS 269-2 Moffett Field, CA 94035-1000, USA

Abstract:	Some of the current best conformant probabilistic planners focus on finding a fixed length plan with maximal probability. While these approaches can find optimal solutions, they often do not scale for large problems or plan lengths. As has been shown in classical planning, heuristic search outperforms bounded length search (especially when an appropriate plan length is not given a priori). The problem with applying heuristic search in probabilistic planning is that effective heuristics are as yet lacking.In this work, we apply heuristic search to conformant probabilistic planning by adapting planning graph heuristics developed for non-deterministic planning. We evaluate a straight-forward application of these planning graph techniques, which amounts to exactly computing a distribution over many relaxed planning graphs (one planning graph for each joint outcome of uncertain actions at each time step). Computing this distribution is costly, so we apply Sequential Monte Carlo (SMC) to approximate it. One important issue that we explore in this work is how to automatically determine the number of samples required for effective heuristic computation. We empirically demonstrate on several domains how our efficient, but sometimes suboptimal, approach enables our planner to solve much larger problems than an existing optimal bounded length probabilistic planner and still find reasonable quality solutions.

Keywords:
本文献已被 ScienceDirect 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏