首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 78 毫秒
1.
基于后悔值的多Agent冲突博弈强化学习模型   总被引:1,自引:0,他引:1  
肖正  张世永 《软件学报》2008,19(11):2957-2967
对于冲突博弈,研究了一种理性保守的行为选择方法,即最小化最坏情况下Agent的后悔值.在该方法下,Agent当前的行为策略在未来可能造成的损失最小,并且在没有任何其他Agent信息的条件下,能够得到Nash均衡混合策略.基于后悔值提出了多Agent复杂环境下冲突博弈的强化学习模型以及算法实现.该模型中通过引入交叉熵距离建立信念更新过程,进一步优化了冲突博弈时的行为选择策略.基于Markov重复博弈模型验证了算法的收敛性,分析了信念与最优策略的关系.此外,与MMDP(multi-agent markov decision process)下Q学习扩展算法相比,该算法在很大程度上减少了冲突发生的次数,增强了Agent行为的协调性,并且提高了系统的性能,有利于维持系统的稳定.  相似文献   

2.
以网络信息传播为背景,针对网络信息传播群体,对群体的静态结构和动态行为进行抽象和分析。以前期使用经典博弈论来分析传播个体行为的研究为基础,建立符合网络信息传播特性的演化博弈模型来刻画网络信息传播群体的交互行为,采用有限理性Agent来模拟真实网络信息传播环境下的信息传播者,使用演化稳定策略和复制者动态对网络信息传播单群体的行为进行静态和动态均衡的分析,并通过计算验证了网络信息传播群体的稳定结构与群体行为的动态均衡具有很强的关联性。  相似文献   

3.
基于行为预测的微博网络信息传播建模   总被引:1,自引:0,他引:1  
研究微博网络中的信息传播及扩散机制在市场营销、舆情管控等方面具有重要意义。当前的传播模型大多忽视了用户间的个体差异。为解决这一问题, 提取了影响转发行为的四类特征, 利用机器学习中的逻辑回归模型分析预测个体转发行为, 并在此基础上融入用户个体差异, 建立了一种基于行为预测的信息传播模型。实验表明, 该模型能较好地模拟真实网络中的信息传播过程。  相似文献   

4.
在信息传播中,用户在重复接收同一信息的情况下其转发行为会具有一定的倾向性。对这种转发的倾向性建模是影响力分析、传播动力学、社会推荐等一系列信息传播相关应用研究领域中的一个关键问题。本文假设用户的转发选择行为主要由用户间的人际影响力决定。人际影响力的大小由信息传播者的影响力和信息接收者的易感性共同作用。本文从真实的信息传播记录中推断出用户隐式的影响力和易感性,进而提出了一种转发选择模型。该模型能够有效解决目前方法存在的对转发选择行为建模不充分和模型泛化能力差的问题。本文选取典型的转发选择建模方法作为比较,将所提的转发选择模型在新浪微博数据上进行对比验证。实验表明,本文所提的模型在两种评价指标上均取得更好效果,证明了所提模型的有效性。  相似文献   

5.
多Agent协作追捕问题是多Agent协调与协作研究中的一个典型问题。针对具有学习能力的单逃跑者追捕问题,提出了一种基于博弈论及Q学习的多Agent协作追捕算法。首先,建立协作追捕团队,并构建协作追捕的博弈模型;其次,通过对逃跑者策略选择的学习,建立逃跑者有限的Step-T累积奖赏的运动轨迹,并把运动轨迹调整到追捕者的策略集中;最后,求解协作追捕博弈得到Nash均衡解,每个Agent执行均衡策略完成追捕任务。同时,针对在求解中可能存在多个均衡解的问题,加入了虚拟行动行为选择算法来选择最优的均衡策略。C#仿真实验表明,所提算法能够有效地解决障碍环境中单个具有学习能力的逃跑者的追捕问题,实验数据对比分析表明该算法在同等条件下的追捕效率要优于纯博弈或纯学习的追捕算法。  相似文献   

6.
Ad Hoc网络中的节点在转发数据时易出现自私行为,为激励自私节点参与数据转发,提出一种节点激励策略IMTFT。根据贝叶斯博弈理论建立节点转发博弈模型,在该模型中引入增加激励因子的改进TFT策略,以均衡激励自私节点。在IMTFT策略下对节点的纳什均衡条件进行推理分析,并确定激励因子相关参数的最优取值。仿真结果表明,该策略能有效激励自私节点参与数据转发,提升网络整体性能。  相似文献   

7.
在研究Q-Learning算法的基础上,将博弈论中的团队协作理论引入到强化学习中,提出了一种基于联合博弈的多Agent学习算法。该算法通过建立多个阶段博弈,根据回报矩阵对阶段博弈的结果进行评估,为其提供一种有效的A-gent行为决策策略,使每个Agent通过最优均衡解或观察协作Agent的历史动作和自身当前情况来预测其所要执行的动作。对任务调度问题进行仿真实验,验证了该算法的收敛性。  相似文献   

8.
为了提高无线传感器网络数据转发的可靠性及能量利用率,本文基于拍卖博弈建立了拍卖路由博弈模型,并提出一种进行转发节点选择的价格路由博弈算法.在算法中潜在的转发节点为了从发送节点获得虚拟货币而相互竞争,发送节点根据各个转发节点的标价选择最佳转发节点.实验仿真表明拍卖路由博弈模型的合理、有效,提出的价格路由博弈算法能够降低节点的能量消耗,延长网络的生命周期.  相似文献   

9.
傅伟  周新力 《计算机工程》2019,45(8):146-151
无人机与有人机混合编队协同作战可提高作战效能,但需要无人机与有人机之间进行稳定的数据连接,将战场态势快速传输至有人机。为此,建立一种基于博弈论的单一价格多属性投标模型,以通信范围内邻居节点的招投标方式,将传输节点的选择过程抽象为招标模型。从节点能量属性、链路稳定性及转发角度出发,设计招投标节点的评分函数和收益模型,最终经过比较多个投标节点的质量属性和投标价格选择最佳节点完成招标节点的数据转发。通过博弈分析证明数据转发算法是激励相容的,可确保节点以真实报价参与到数据转发过程中。仿真结果表明,与GPSR算法及AMIM算法相比,该算法在保证网络能耗均衡与链路稳定性的同时,能提高网络生存时间,适用于无人机与有人机混合编队协同作战通信环境。  相似文献   

10.
社交网络中用户转发是信息传播的重要渠道,研究用户转发模式和信息传播规律,将有利于在网络话题传播过程中进行监控和抑制。现有的建模研究中,存在模型通常缺少时效性,用户行为难以准确刻画的问题。因此,着重分析了社交网络用户行为模式,基于用户连接强度和邻居节点的影响改进了转发概率计算,其次在经典的传染病动力学SCIR模型中,引入在线和离线状态的节点,通过用户在线比率控制网络活跃度。仿真结果表明,该模型相较传统SCIR模型在信息传播过程中具有较好的稳定性和更高的覆盖率,节点属性变化走势更加接近真实网络,可以较好地模拟社交网络中的热点话题的传播规律。  相似文献   

11.
Abstract This paper describes an approach to the design of interactive multimedia materials being developed in a European Community project. The developmental process is seen as a dialogue between technologists and teachers. This dialogue is often problematic because of the differences in training, experience and culture between them. Conditions needed for fruitful dialogue are described and the generic model for learning design used in the project is explained.  相似文献   

12.
European Community policy and the market   总被引:1,自引:0,他引:1  
Abstract This paper starts with some reflections on the policy considerations and priorities which are shaping European Commission (EC) research programmes. Then it attempts to position the current projects which seek to capitalise on information and communications technologies for learning in relation to these priorities and the apparent realities of the marketplace. It concludes that while there are grounds to be optimistic about the contribution EC programmes can make to the efficiency and standard of education and training, they are still too technology driven.  相似文献   

13.
融合集成方法已经广泛应用在模式识别领域,然而一些基分类器实时性能稳定性较差,导致多分类器融合性能差,针对上述问题本文提出了一种新的基于多分类器的子融合集成分类器系统。该方法考虑在度量层融合层次之上通过对各类基多分类器进行动态选择,票数最多的类别作为融合系统中对特征向量识别的类别,构成一种新的自适应子融合集成分类器方法。实验表明,该方法比传统的分类器以及分类融合方法识别准确率明显更高,具有更好的鲁棒性。  相似文献   

14.
Development of software intensive systems (systems) in practice involves a series of self-contained phases for the lifecycle of a system. Semantic and temporal gaps, which occur among phases and among developer disciplines within and across phases, hinder the ongoing development of a system because of the interdependencies among phases and among disciplines. Such gaps are magnified among systems that are developed at different times by different development teams, which may limit reuse of artifacts of systems development and interoperability among the systems. This article discusses such gaps and a systems development process for avoiding them.  相似文献   

15.
This paper presents control charts models and the necessary simulation software for the location of economic values of the control parameters. The simulation program is written in FORTRAN, requires only 10K of main storage, and can run on most mini and micro computers. Two models are presented - one describes the process when it is operating at full capacity and the other when the process is operating under capacity. The models allow the product quality to deteriorate to a further level before an existing out-of-control state is detected, and they can also be used in situations where no prior knowledge exists of the out-of-control causes and the resulting proportion defectives.  相似文献   

16.
Going through a few examples of robot artists who are recognized worldwide, we try to analyze the deepest meaning of what is called “robot art” and the related art field definition. We also try to highlight its well-marked borders, such as kinetic sculptures, kinetic art, cyber art, and cyberpunk. A brief excursion into the importance of the context, the message, and its semiotics is also provided, case by case, together with a few hints on the history of this discipline in the light of an artistic perspective. Therefore, the aim of this article is to try to summarize the main characteristics that might classify robot art as a unique and innovative discipline, and to track down some of the principles by which a robotic artifact can or cannot be considered an art piece in terms of social, cultural, and strictly artistic interest. This work was presented in part at the 13th International Symposium on Artificial Life and Robotics, Oita, Japan, January 31–February 2, 2008  相似文献   

17.
为了设计一种具有低成本、低功耗、易操作、功能强且可靠性高的煤矿井下安全分站,针对煤矿安全生产实际,文章提出了采用MCS-51系列单片机为核心、具有CAN总线通信接口的煤矿井下安全监控分站的设计方案;首先给出煤矿井下安全监控分站的整体构架设计,然后着重阐述模拟量输入信号处理系统的设计过程,最后说明单片机最小系统及其键盘、显示、报警、通信等各个组成部分的设计;为验证设计方案的可行性与有效性,使用Proteus软件对设计内容进行仿真验证,设计的煤矿井下安全监控分站具有瓦斯、温度等模拟量参数超标报警功能和电机开停、风门开闭等开关量指示功能;仿真结果表明:设计的煤矿井下安全监控分站具有一定的实际应用价值.  相似文献   

18.
《计算机科学》2007,34(4):148-148
Recent years have seen rapid advances in various grid-related technologies, middleware, and applications. The GCC conference has become one of the largest scientific events worldwide in grid and cooperative computing. The 6th international conference on grid and cooperative computing (GCC2007) Sponsored by China Computer Federation (CCF),Institute of Computing Technology, Chinese Academy of Sciences (ICT) and Xinjiang University ,and in Cooperation with IEEE Computer Soceity ,is to be held from August 16 to 18, 2007 in Urumchi, Xinjiang, China.  相似文献   

19.
Although there are many arguments that logic is an appropriate tool for artificial intelligence, there has been a perceived problem with the monotonicity of classical logic. This paper elaborates on the idea that reasoning should be viewed as theory formation where logic tells us the consequences of our assumptions. The two activities of predicting what is expected to be true and explaining observations are considered in a simple theory formation framework. Properties of each activity are discussed, along with a number of proposals as to what should be predicted or accepted as reasonable explanations. An architecture is proposed to combine explanation and prediction into one coherent framework. Algorithms used to implement the system as well as examples from a running implementation are given.  相似文献   

20.
正The 34th Chinese Control Conference and SICE Annual Conference 2015(CCCSICE2015)is organized by the Technical Committee on Control Theory(TCCT)of Chinese Association of Automation(CAA)and the Society of Instrument and Control Engineers(SICE)of Japan,and locally organized by Hangzhou Dianzi University(HDU).  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号