首页 | 本学科首页   官方微博 | 高级检索  
     


Dynamic intermittent Q‐learning–based model‐free suboptimal co‐design of ‐stabilization
Authors:Yongliang Yang  Kyriakos G Vamvoudakis  Henrique Ferraz  Hamidreza Modares
Abstract:This paper proposes an intermittent model‐free learning algorithm for linear time‐invariant systems, where the control policy and transmission decisions are co‐designed simultaneously while also being subjected to worst‐case disturbances. The control policy is designed by introducing an internal dynamical system to further reduce the transmission rate and provide bandwidth flexibility in cyber‐physical systems. Moreover, a Q‐learning algorithm with two actors and a single critic structure is developed to learn the optimal parameters of a Q‐function. It is shown by using an impulsive system approach that the closed‐loop system has an asymptotically stable equilibrium and that no Zeno behavior occurs. Furthermore, a qualitative performance analysis of the model‐free dynamic intermittent framework is given and shows the degree of suboptimality concerning the optimal continuous updated controller. Finally, a numerical simulation of an unknown system is carried out to highlight the efficacy of the proposed framework.
Keywords:actor‐critic  dynamic event‐triggering condition  intermittent Q‐learning  suboptimal performance  Zeno‐free
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号