首页 | 本学科首页   官方微博 | 高级检索  
     


Approximate dynamic programming-based approaches for input-output data-driven control of nonlinear processes
Authors:Jong Min Lee  [Author Vitae] [Author Vitae]
Affiliation:311 Ferst Dr. NW, School of Chemical and Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA 30332-0100, USA
Abstract:We propose two approximate dynamic programming (ADP)-based strategies for control of nonlinear processes using input-output data. In the first strategy, which we term ‘J-learning,’ one builds an empirical nonlinear model using closed-loop test data and performs dynamic programming with it to derive an improved control policy. In the second strategy, called ‘Q-learning,’ one tries to learn an improved control policy in a model-less manner. Compared to the conventional model predictive control approach, the new approach offers some practical advantages in using nonlinear empirical models for process control. Besides the potential reduction in the on-line computational burden, it offers a convenient way to control the degree of model extrapolation in the calculation of optimal control moves. One major difficulty associated with using an empirical model within the multi-step predictive control setting is that the model can be excessively extrapolated into regions of the state space where identification data were scarce or nonexistent, leading to performances far worse than predicted by the model. Within the proposed ADP-based strategies, this problem is handled by imposing a penalty term designed on the basis of local data distribution. A CSTR example is provided to illustrate the proposed approaches.
Keywords:Nonlinear model identification  Nonlinear model predictive control  Approximate dynamic programming  NARX model  Reinforcement learning  _method=retrieve&  _eid=1-s2  0-S0005109805000786&  _mathId=si17  gif&  _pii=S0005109805000786&  _issn=00051098&  _acct=C000053510&  _version=1&  _userid=1524097&  md5=1f09dda60260e74c64fc0041050472ab')" style="cursor:pointer  Q-learning" target="_blank">" alt="Click to view the MathML source" title="Click to view the MathML source">Q-learning
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号