首页 | 本学科首页   官方微博 | 高级检索  
     

A model-based approximate λ-policy iteration approach to online evasive path planning and the video game Ms. Pac-Man
作者姓名:Greg FODERARO  Vikram RAJU  Silvia FERRARI
作者单位:Department of Mechanical Engineering and Materials Science, Duke University,Department of Mechanical Engineering and Materials Science, Duke University,Department of Mechanical Engineering and Materials Science, Duke University
基金项目:This work was supported by the National Science Foundation (No.ECS 0925407).
摘    要:This paper presents a model-based approximate λ-policy iteration approach using temporal differences for optimizing paths online for a pursuit-evasion problem, where an agent must visit several target positions within a region of interest while simultaneously avoiding one or more actively pursuing adversaries. This method is relevant to applications, such as robotic path planning, mobile-sensor applications, and path exposure. The methodology described utilizes cell decomposition to construct a decision tree and implements a temporal difference-based approximate λ-policy iteration to combine online learning with prior knowledge through modeling to achieve the objectives of minimizing the risk of being caught by an adversary and maximizing a reward associated with visiting target locations. Online learning and frequent decision tree updates allow the algorithm to quickly adapt to unexpected movements by the adversaries or dynamic environments. The approach is illustrated through a modified version of the video game Ms. Pac-Man, which is shown to be a benchmark example of the pursuit-evasion problem. The results show that the approach presented in this paper outperforms several other methods as well as most human players.

收稿时间:17 March 2009
修稿时间:2011/3/23 0:00:00
本文献已被 CNKI 维普 万方数据 SpringerLink 等数据库收录!
点击此处可从《控制理论与应用(英文版)》浏览原始摘要信息
点击此处可从《控制理论与应用(英文版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号