首页 | 本学科首页   官方微博 | 高级检索  
     

具有加性和乘性噪声的线性离散时间随机系统的无模型最优跟踪控制
引用本文:银玉博,罗世贤,万韬. 具有加性和乘性噪声的线性离散时间随机系统的无模型最优跟踪控制[J]. 控制理论与应用, 2023, 40(6): 1014-1022
作者姓名:银玉博  罗世贤  万韬
作者单位:广西大学,广西大学,华南理工大学
基金项目:国家自然科学基金项目(62003104), 广西壮族自治区自然科学基金项目(2022GXNSFBA035649), 广西大学学科交叉科研项目(2022JCC019), 广西 科技计划项目(桂科AD23026217)
摘    要:本文研究一类同时受加性和乘性噪声影响的离散时间随机系统的最优跟踪控制问题.通过构造由原始系统和参考轨迹组成的增广系统,将随机线性二次跟踪控制(SLQT)的成本函数转化为与增广状态相关的二次型函数,由此推导出用于求解SLQT的贝尔曼方程和增广随机代数黎卡提方程(SARE),而后进一步针对系统和参考轨迹动力学信息完全未知的情形,提出一种Q-学习算法来在线求解增广SARE,证明了该算法的收敛性,并采用批处理最小二乘法(BLS)解决该在线无模型控制算法的实现问题.通过对单相电压源UPS逆变器的仿真,验证了所提出控制方案的有效性.

关 键 词:随机线性二次跟踪  随机噪声  无模型控制  Q-学习
收稿时间:2021-11-13
修稿时间:2023-06-27

Model-free optimal tracking control for linear discrete-time stochastic systems subject to additive and multiplicative noises
YIN Yu-bo,LUO Shi-xian and WAN Tao. Model-free optimal tracking control for linear discrete-time stochastic systems subject to additive and multiplicative noises[J]. Control Theory & Applications, 2023, 40(6): 1014-1022
Authors:YIN Yu-bo  LUO Shi-xian  WAN Tao
Affiliation:Guangxi University,Guangxi University,South China University of Technology
Abstract:This paper focuses on the optimal tracking control problem for a class of discrete-time stochastic systems with additive and multiplicative noises. By constructing an augmented system composed of the original system and the reference generator, the cost function of the stochastic linear quadratic tracking control (SLQT) is transformed into a quadratic function in terms of the augmented state. A Bellman equation and an augmented stochastic algebraic Riccati equation (SARE) for solving the SLQT are then derived. For the case of both the system and reference generator dynamics are completely unknown, a Q-learning algorithm is proposed to solve the augmented SARE, and its convergence is proved rigorously. Moreover, the online model-free control algorithm implements with the batch least square method (BLS). The effectiveness of the proposed control scheme is verified by the single-phase voltage-source UPS inverter.
Keywords:stochastic linear quadratic tracking   stochastic noise   model-free control   Q-learning
点击此处可从《控制理论与应用》浏览原始摘要信息
点击此处可从《控制理论与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号