基于强化学习的一类具有输入约束非线性系统最优控制 Optimal control for a class of nonlinear systems with input constraints based on reinforcement learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于强化学习的一类具有输入约束非线性系统最优控制

引用本文：	罗傲,肖文彬,周琪,鲁仁全.基于强化学习的一类具有输入约束非线性系统最优控制[J].控制理论与应用,2022,39(1):154-164.

作者姓名：	罗傲肖文彬周琪鲁仁全

作者单位：	广东工业大学广东省智能决策与协同控制重点实验室,广东广州510006

基金项目：	国家自然科学基金项目(62121004, 61973091),“广东特支计划”本土创新创业团队项目(2019BT02X353), 广东省重点领域研发计划项目 (2021B0101410005)资助.

摘要：	针对部分系统存在输入约束和不可测状态的最优控制问题,本文将强化学习中基于执行–评价结构的近似最优算法与反步法相结合,提出了一种最优跟踪控制策略.首先,利用神经网络构造非线性观测器估计系统的不可测状态.然后,设计一种非二次型效用函数解决系统的输入约束问题.相比现有的最优方法,本文提出的最优跟踪控制方法不仅具有反步法在处理...
关键词：	输入约束不可测状态最优控制强化学习反步法
收稿时间：	2020/12/14 0:00:00
修稿时间：	2021/6/25 0:00:00
Optimal control for a class of nonlinear systems with input constraints based on reinforcement learning

LUO Ao,XIAO Wen-bin,ZHOU Qi and LU Ren-quan.Optimal control for a class of nonlinear systems with input constraints based on reinforcement learning[J].Control Theory & Applications,2022,39(1):154-164.

Authors:	LUO Ao XIAO Wen-bin ZHOU Qi and LU Ren-quan

Affiliation:	Guangdong University of Technology,Guangdong University of Technology,Guangdong University of Technology,Guangdong University of Technology

Abstract:	In this paper, by incorporating the approximate optimization algorithm, which is derived from actor-critic structure in reinforcement learning, into the backstepping, an optimal tracking control strategy is proposed for a class of nonlinear systems with immeasurable states and input constraints. First, a nonlinear observer is constructed with neural network to estimate the immeasurable states. Then, a non-quadratic cost function is designed to solve the problem of controller constraints. Compared with the existing optimization methods, the optimal tracking control method proposed in this paper not only has the advantage of backstepping technique in addressing the n-order system tracking problem, but also ensures that all virtual controllers are optimal. And this method simplifies the controller design. Finally, according to Lyapunov stability theory, it is proven that all signals in the closed-loop system are uniformly ultimately bounded. The effectiveness of the proposed method is verified by the simulation results.

Keywords:	input constraints immeasurable states optimal control reinforcement learning backstepping
本文献已被维普万方数据等数据库收录！
	点击此处可从《控制理论与应用》浏览原始摘要信息
	点击此处可从《控制理论与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏