首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 93 毫秒
1.
毛艳岭  富月 《自动化学报》2023,49(10):2122-2135
针对具有未知动态和M个平衡点的连续时间非线性系统, 将线性自适应最优切换控制器和未建模动态补偿器相结合, 基于嵌入转换技术和近似动态规划思想, 提出一种自适应最优切换控制方法. 首先在非线性系统的M个平衡点建立M个线性化模型, 当模型参数已知时, 提出由线性最优切换控制器、切换准则、未建模动态补偿器以及非线性系统组成的控制系统结构; 当模型参数未知时, 在每个平衡点附近采集输入和状态数据, 利用黎卡提方程的迭代求解公式、最小二乘方法、极小值原理以及二次规划技术得到非线性系统的自适应最优切换控制器和最优切换序列; 最后进行仿真实验, 实验结果验证了所提方法的有效性、优越性和实际可应用性.  相似文献   

2.
针对模型和实际之间的差异,提出了一种基于时变线性二次型问题的动态系统优化和参数估计集成的算法该算法能逼近实际问题最优解,给出了该模型收敛的一个充分条件,分析了它的最优性,仿真例子说明了该算法的有效性和实用性。  相似文献   

3.
一类非线性不确定系统的最优自适应控制   总被引:2,自引:1,他引:1  
研究了一类含有系统扰动,并且状态项与控制项中同时含有未知参数的非线性系统的反馈稳定问题.在控制器的设计中,将原系统的自适应稳定问题转化为扩展系统的非自适应稳定问题,并利用扩展系统的鲁棒控制Lyapunov函数,设计出使原系统自适应稳定的控制律.进一步,利用逆最优的方法,证明了该控制律同时也是满足某种性能指标的最优控制。  相似文献   

4.
为克服全状态对称约束以及控制策略频繁更新的局限,同时使得无限时间的代价函数最优,针对一类具有部分动力学未知的仿射非线性连续系统,提出一种带状态约束的事件触发积分强化学习的控制器设计方法。该方法是一种基于数据的在线策略迭代方法。引入系统转换将带有全状态约束的系统转化为不含约束的系统。基于事件触发机制以及积分强化学习算法,通过交替执行系统转换、策略评估、策略改进,最终系统在满足全状态约束的情况下,代价函数以及控制策略将分别收敛于最优值,并能降低控制策略的更新频率。此外,通过构建李亚普诺夫函数对系统以及评论神经网络权重误差的稳定性进行严格的分析。单连杆机械臂的仿真实验也进一步说明算法的可行性。  相似文献   

5.
张绍杰  吴雪  刘春生 《自动化学报》2018,44(12):2188-2197
本文针对一类具有执行器故障的多输入多输出(Multi-input multi-output,MIMO)不确定连续仿射非线性系统,提出了一种最优自适应输出跟踪控制方案.设计了保证系统稳定性的不确定项估计神经网络权值调整算法,仅采用评价网络即可同时获得无限时域代价函数和满足哈密顿-雅可比-贝尔曼(Hamilton-Jacobi-Bellman,HJB)方程的最优控制输入.考虑执行器卡死和部分失效故障,设计最优自适应补偿控制律,所设计的控制律可以实现对参考输出的一致最终有界跟踪.飞行器控制仿真和对比验证表明了本文方法的有效性和优越性.  相似文献   

6.
应用一种新的自适应动态最优化方法(ADP),在线实现对非线性连续系统的最优控制。首先应用汉密尔顿函数(Hamilton-Jacobi-Bellman, HJB)求解系统的最优控制,并应用神经网络BP算法对汉密尔顿函数中的性能指标进行估计,进而得到非线性连续系统的最优控制。同时引进一种新的自适应算法,基于参数误差,在线实现对系统进行动态最优求解,而且通过李亚普诺夫方法对参数收敛情况也进行详细的分析。最后,用仿真结果来验证所提出的方法的可行性。  相似文献   

7.
关于多变量非线性系统的自适应模糊控制   总被引:14,自引:1,他引:14  
结合模糊逻辑系统、自适应控制和H∞控制,对一类非线性多变量未知系统提出了新的控制策略,给出了控制算法的稳定性分析.仿真结果证明了所提控制算法的有效性.  相似文献   

8.
In this paper, an asymmetric bipartite consensus problem for the nonlinear multi-agent systems with cooperative and antagonistic interactions is studied under the event-triggered mechanism. For the agents described by a structurally balanced signed digraph, the asymmetric bipartite consensus objective is firstly defined, assigning the agents’ output to different signs and module values. Considering with the completely unknown dynamics of the agents, a novel event-triggered model-free adaptive bipartite control protocol is designed based on the agents’ triggered outputs and an equivalent compact form data model. By utilizing the Lyapunov analysis method, the threshold of the triggering condition is obtained. Subsequently, the asymptotic convergence of the tracking error is deduced and a sufficient condition is obtained based on the contraction mapping principle. Finally, the simulation example further demonstrates the effectiveness of the protocol.   相似文献   

9.
离散时间非线性时滞系统最优控制的DISOPE算法   总被引:4,自引:1,他引:4  
对于非线性时滞系统的最优控制,提出一种基于线性时滞模型和二次型性能指标问题的迭代处蒙混过关针时滞系统化为满足可尔可夫性质的增广状态系统,在模型和实际存在差异的情况下,该算法通过迭代求解时滞线性最优控制问题和参数估计问题,获得原问题的最优解,仿真实例表明该算法的有效性和实用性。  相似文献   

10.
This paper presents a new model-reference adaptive control method based on a bi-objective optimal control formulation for systems with input uncertainty. A parallel predictor model is constructed to relate the predictor error to the estimation error of the control effectiveness matrix. In this work, we develop an optimal control modification adaptive control approach that seeks to minimize a bi-objective linear quadratic cost function of both the tracking error norm and the predictor error norm simultaneously. The resulting adaptive laws for the parametric uncertainty and control effectiveness uncertainty are dependent on both the tracking error and the predictor error, while the adaptive laws for the feedback gain and command feedforward gain are only dependent on the tracking error. The optimal control modification term provides robustness to the adaptive laws naturally from the optimal control framework. Simulations demonstrate the effectiveness of the proposed adaptive control approach.   相似文献   

11.
给出了一类离散时间非线性系统的不依赖受控系统数学模型的学习自适应控制方案,它不需要受控系统的结构信息、数学模型、外部试验信号和训练过程,仅用受控系统的I/O数据来设计,传统的未建模动态不存在,所给出的计算机仿真结果说明了所给出的方案的正确性和有效性。  相似文献   

12.
非线性非仿射离散时间系统的两阶段最优迭代学习控制   总被引:3,自引:0,他引:3  
On the basis of a new dynamic linearization technology along the iteration axis,a dual-stage optimal iterative learning control is presented for nonlinear and non-affine discrete-time systems.Dual-stage indicates that two optimal learning stages are designed respectively to improve control input sequence and the learning gain iteratively.The main feature is that the controller design and convergence analysis only depend on the I/O data of the dynamical system.In other words,we can easily select the control parameters without knowing any other knowledge of the system.Simulation study illustrates the geometrical convergence of the presented method along the iteration axis,in which an example of freeway traffic iterative learning control is noteworthy for its intrinsic engineering importance.  相似文献   

13.
本文研究在平均驻留时间约束下,一类含有执行器故障的切换非线性系统输出反馈自适应模糊事件触发容错控制问题.首先,建立了一个模态依赖的状态观测器估计不可测量状态.利用模糊逻辑系统来逼近未知项.其次,构建自适应模糊容错事件触发控制方案能够节省网络资源和数据传输.然后,通过构造多Lyapunov函数和平均驻留时间法,证明闭环系统所有状态半全局一致最终有界的同时排除了Zeno现象.最后,通过数值仿真验证该方法的有效性.  相似文献   

14.
本文研究在平均驻留时间约束下,一类含有执行器故障的切换非线性系统输出反馈自适应模糊事件触发容错控制问题.首先,建立了一个模态依赖的状态观测器估计不可测量状态.利用模糊逻辑系统来逼近未知项.其次,构建自适应模糊容错事件触发控制方案能够节省网络资源和数据传输.然后,通过构造多Lyapunov函数和平均驻留时间法,证明闭环系统所有状态半全局一致最终有界的同时排除了Zeno现象.最后,通过数值仿真验证该方法的有效性.  相似文献   

15.
非线性系统的神经网络鲁棒自适应跟踪控制   总被引:1,自引:0,他引:1  
针对一类具有未知非线性函数和未知虚拟系数非线性函数的二阶非线性系统,提出了一种神经网络鲁棒自适应输出跟踪控制方法.用李雅普诺夫稳定性分析方法证明了本文的神经网络自适应控制器能够使受控系统内的所有信号均为有界.选择的神经网络权值调整规律可以防止自适应控制中的参数漂移.  相似文献   

16.
非线性非仿射离散时间系统的两阶段最优迭代学习控制   总被引:1,自引:0,他引:1  
池荣虎  侯忠生 《自动化学报》2007,33(10):1061-1065
针对非仿射非线性离散时间系统, 基于一种新的沿迭代轴的动态线性化技术, 提出了双层最优迭代学习控制算法. 双层意味着分别设计了两个最优学习层, 迭代的改进控制输入序列和学习增益. 其主要特点是控制器的设计和收敛性分析只依赖于动态系统的 I/O 数据. 换句话说, 不需要知道系统的任何其他信息就可以很容易的选取控制器参数. 仿真研究表明了提出的算法沿迭代轴具有几何收敛性, 这一特点在快速路交通迭代学习控制中具有重要的工程意义.  相似文献   

17.
Variable Resolution Discretization in Optimal Control   总被引:3,自引:0,他引:3  
Munos  Rémi  Moore  Andrew 《Machine Learning》2002,49(2-3):291-323
The problem of state abstraction is of central importance in optimal control, reinforcement learning and Markov decision processes. This paper studies the case of variable resolution state abstraction for continuous time and space, deterministic dynamic control problems in which near-optimal policies are required. We begin by defining a class of variable resolution policy and value function representations based on Kuhn triangulations embedded in a kd-trie. We then consider top-down approaches to choosing which cells to split in order to generate improved policies. The core of this paper is the introduction and evaluation of a wide variety of possible splitting criteria. We begin with local approaches based on value function and policy properties that use only features of individual cells in making split choices. Later, by introducing two new non-local measures, influence and variance, we derive splitting criteria that allow one cell to efficiently take into account its impact on other cells when deciding whether to split. Influence is an efficiently-calculable measure of the extent to which changes in some state effect the value function of some other states. Variance is an efficiently-calculable measure of how risky is some state in a Markov chain: a low variance state is one in which we would be very surprised if, during any one execution, the long-term reward attained from that state differed substantially from its expected value, given by the value function.The paper proceeds by graphically demonstrating the various approaches to splitting on the familiar, non-linear, non-minimum phase, and two dimensional problem of the Car on the hill. It then evaluates the performance of a variety of splitting criteria on many benchmark problems, paying careful attention to their number-of-cells versus closeness-to-optimality tradeoff curves.  相似文献   

18.
This paper introduces an observer-based adaptive optimal control method for unknown singularly perturbed nonlinear systems with input constraints. First, a multi-time scales dynamic neural network (MTSDNN) observer with a novel updating law derived from a properly designed Lyapunov function is proposed to estimate the system states. Then, an adaptive learning rule driven by the critic NN weight error is presented for the critic NN, which is used to approximate the optimal cost function. Finally, the optimal control action is calculated by online solving the Hamilton-Jacobi-Bellman (HJB) equation associated with the MTSDNN observer and critic NN. The stability of the overall closed-loop system consisting of the MTSDNN observer, the critic NN and the optimal control action is proved. The proposed observer-based optimal control approach has an essential advantage that the system dynamics are not needed for implementation, and only the measured input/output data is needed. Moreover, the proposed optimal control design takes the input constraints into consideration and thus can overcome the restriction of actuator saturation. Simulation results are presented to confirm the validity of the investigated approach.   相似文献   

19.
This paper investigates the event-triggered security consensus problem for nonlinear multi-agent systems (MASs) under denial-of-service (DoS) attacks over an undirected graph. A novel adaptive memory observer-based anti-disturbance control scheme is presented to improve the observer accuracy by adding a buffer for the system output measurements. Meanwhile, this control scheme can also provide more reasonable control signals when DoS attacks occur. To save network resources, an adaptive memory event-triggered mechanism (AMETM) is also proposed and Zeno behavior is excluded. It is worth mentioning that the AMETM’s updates do not require global information. Then, the observer and controller gains are obtained by using the linear matrix inequality (LMI) technique. Finally, simulation examples show the effectiveness of the proposed control scheme.   相似文献   

20.
Aimed at infinite horizon optimal control problems of discrete time-varying nonlinear systems, in this paper, a new iterative adaptive dynamic programming algorithm, which is the discrete-time time-varying policy iteration (DTTV) algorithm, is developed. The iterative control law is designed to update the iterative value function which approximates the index function of optimal performance. The admissibility of the iterative control law is analyzed. The results show that the iterative value function is non-increasingly convergent to the Bellman-equation optimal solution. To implement the algorithm, neural networks are employed and a new implementation structure is established, which avoids solving the generalized Bellman equation in each iteration. Finally, the optimal control laws for torsional pendulum and inverted pendulum systems are obtained by using the DTTV policy iteration algorithm, where the mass and pendulum bar length are permitted to be time-varying parameters. The effectiveness of the developed method is illustrated by numerical results and comparisons.   相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号