首页 | 本学科首页   官方微博 | 高级检索  
     

逻辑马尔可夫决策编程和关系马尔可夫决策编程若干最新进展
引用本文:王蓁蓁,邢汉承,张志政,倪庆剑.逻辑马尔可夫决策编程和关系马尔可夫决策编程若干最新进展[J].计算机科学,2007,34(10):1-7.
作者姓名:王蓁蓁  邢汉承  张志政  倪庆剑
作者单位:东南大学计算机科学与工程学院,南京,210096
摘    要:逻辑马尔可夫决策过程和关系马尔可夫决策过程的引入,使得人们可能简洁地、陈述地表达复杂的马尔可夫决策过程。本文首先介绍有关逻辑马尔可夫决策过程和关系马尔可夫决策过程的概念,然后重点介绍它们与普通的马尔可夫决策过程根本不同的一些算法:①依赖于基本状态空间RL的转换法;②把Bellman方程推广到抽象状态空间的方法;③利用策略偏置空间寻求近似最优策略方法。最后对它们的研究现状进行总结及其对它们发展的一些展望。

关 键 词:逻辑马尔可夫决策过程  关系马尔可夫决策过程

Several New Advances of Logical Markov Decision Processes and Relational Markov Decision Processes
WANG Zhen-Zhen,XING Han-Cheng,ZHANG Zhi-Zheng,NI Qing-Jian.Several New Advances of Logical Markov Decision Processes and Relational Markov Decision Processes[J].Computer Science,2007,34(10):1-7.
Authors:WANG Zhen-Zhen  XING Han-Cheng  ZHANG Zhi-Zheng  NI Qing-Jian
Abstract:Using logical Markov decision processes (LDMDPs) and relational Markov decision processes (RMDPs) one can compactly and declaratively represent complex Markov decision processes.This paper firstly introduces central con- cepts of LOMDPs and RMDPs.Then several algorithms that are different from regular Markov decision processes are reviewed:1.The transition method relying on ground state space.2.A relational upgrade of the Bellman update oper- ation.3.Approximate policy iteration using policy bias space.Finally,the paper gives conclusions of recent work and suggests future work.In this way,people are able to get an intensive,comprehensive and in-depth understanding of LOMDPs and RMDPs.
Keywords:Logical Markov decision processes  Relational Markov decision processes
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号