首页 | 本学科首页   官方微博 | 高级检索  
     

基于交互式动态影响图的对手建模
引用本文:罗键 武鹤. 基于交互式动态影响图的对手建模[J]. 控制与决策, 2016, 31(4): 635-639
作者姓名:罗键 武鹤
作者单位:厦门大学信息科学与技术学院,福建厦门361005.
基金项目:

国家自然科学基金项目(61375070);福建省重大科技项目(2011H6027).

摘    要:

在充满竞争的环境中, 资源有限导致智能体之间存在利益冲突, 有必要建立对手模型并对其行为进行准确预测, 从而制定对自身有利的策略. 利用交互式动态影响图对未知对手进行建模, 将对手的候选模型保存在模型节点并随时间更新其信度. 结合观测到的对手动作, 在模型空间中利用“观察-动作”序列逐步排除候选模型, 最终判定对手的真实模型. 实验结果表明, 所提出的算法取得了很好的效果, 验证了该算法的实用性.



关 键 词:

交互式动态影响图|多智能体|对手建模|策略树

收稿时间:2015-03-14
修稿时间:2015-07-21

Opponent modeling based on interactive dynamic influence diagram
LUO Jian WU He. Opponent modeling based on interactive dynamic influence diagram[J]. Control and Decision, 2016, 31(4): 635-639
Authors:LUO Jian WU He
Abstract:

In an environment that is full of competition, the limited resources lead to the conflict of interests which exists among the agents, therefore it is necessary to establish models of the opponent, and to accurately predict its behaviors in order to make strategies for our own advantage. The interactive dynamic influence diagrams are used to model the unknown opponent by keeping the candidate models of the opponent in the model node which can be updated over time. Then combining with the observed actions of the opponent, candidate models in the model space are pruned by  sing‘abservationaction’sequences to finally identify the true model of the opponent. The experiment results show the effectiveness and feasibility of the proposed algorithm.

Keywords:

interactive dynamic influence diagram|multi-agent|opponent modeling|policy tree

点击此处可从《控制与决策》浏览原始摘要信息
点击此处可从《控制与决策》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号