具有未知动态的线性系统二人零和博弈问题在线学习方案 Online solution of two-player zero-sum games for linear systems with unknown dynamics期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

具有未知动态的线性系统二人零和博弈问题在线学习方案

引用本文：	富月,柴天佑.具有未知动态的线性系统二人零和博弈问题在线学习方案[J].控制理论与应用,2015,32(2):196-201.

作者姓名：	富月柴天佑

作者单位：	东北大学流程工业综合自动化国家重点实验室,辽宁沈阳,110819

基金项目：	国家自然科学基金项目(61374042), 中央高校基本科研业务费基金项目(N130408003, N130108001)资助.

摘要：	针对具有未知动态线性系统的二人零和博弈问题,本文提出了一种新的基于单环迭代方法的在线学习方案.为保证单环迭代方法的收敛性,给出了一种新的分析方法.在系统内部矩阵A,控制输入矩阵B以及干扰输入矩阵D均未知的情况下,通过在线迭代策略,同步得到了博弈代数黎卡提方程的近似解,以及控制和干扰策略.仿真结果表明了所提方法的有效性.
关键词：	二人零和博弈策略迭代博弈代数黎卡提方程
收稿时间：	1/9/2014 12:00:00 AM
修稿时间：	2014/10/18 0:00:00
Online solution of two-player zero-sum games for linear systems with unknown dynamics

FU Yue and CHAI Tian-you.Online solution of two-player zero-sum games for linear systems with unknown dynamics[J].Control Theory & Applications,2015,32(2):196-201.

Authors:	FU Yue and CHAI Tian-you

Affiliation:	State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University and State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University

Abstract:	For two-player zero-sum games of continuous-time linear systems with unknown dynamics, we present an online adaptive learning algorithm based on the policy iteration (PI) scheme with only one loop. A new analytical method to prove the convergence of the PI scheme is presented. An approximate solution to the generalized game algebraic Riccati equation without using a priori knowledge of the system matrices is developed. Simulation results illustrate the effectiveness of the proposed method.

Keywords:	two-player zero-sum game policy iterations game algebraic Riccati equation
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《控制理论与应用》浏览原始摘要信息
	点击此处可从《控制理论与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏