Markov控制过程基于性能势的平均代价最优策略 OPTIMALITY STRATEGY OF AVERAGE COST BASED PERFORMANCE POTENTIALS FOR MARKOV CONTROL PROCESS期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Markov控制过程基于性能势的平均代价最优策略

引用本文：	周亚平,奚宏生,殷保群,孙德敏.Markov控制过程基于性能势的平均代价最优策略[J].自动化学报,2002,28(6):904-910.

作者姓名：	周亚平奚宏生殷保群孙德敏

作者单位：	1.中国科技大学管理科学系,合肥;

基金项目：	国家自然科学基金 ( 6 9974 0 37)，国家高性能计算基金 ( 0 0 2 12 )资助

摘要：	研究了一类离散时间Markov控制过程平均代价性能最优控制决策问题.应用 Markov性能势的基本性质,在很一般性的假设条件下,直接导出了无限时间平均代价模型在紧致行动集上的最优性方程及其解的存在性定理.提出了求解最优平稳控制策略的迭代算法,并讨论了这种算法的收敛性问题.最后通过分析一个实例来说明这种算法的应用.
关键词：	Markov控制过程性能势平均代价模型最优平稳策略
收稿时间：	2000-12-7
修稿时间：	2000年12月7日
OPTIMALITY STRATEGY OF AVERAGE COST BASED PERFORMANCE POTENTIALS FOR MARKOV CONTROL PROCESS

ZHOU Ya-Ping,XI Hong-Sheng,YIN Bao-Qun,SUN De-Min.OPTIMALITY STRATEGY OF AVERAGE COST BASED PERFORMANCE POTENTIALS FOR MARKOV CONTROL PROCESS[J].Acta Automatica Sinica,2002,28(6):904-910.

Authors:	ZHOU Ya-Ping XI Hong-Sheng YIN Bao-Qun SUN De-Min

Affiliation:	1.Department of Management Science,University of Science and Technology of China,Hefei;Department of Automation,University of Science and Technology of China,Hefei

Abstract:	This paper deals with the average cost optimization problem for a class of discrete time Markov control processes. Under quite general assumptions, the optimality equation is directly established and the existence theorem of optimal solution is proved for infinite time average cost model in a compact action set by using basic properties of the Markov performance potentials. The iterate algorithm for solving optimal stationary control strategy is suggested and the convergence problem of this algorithm is discussed. Finally, a numerical example is analyzed to illustrate the application of the proposed algorithm.

Keywords:	Markov control process performance potentials average cost model optimal stationary strategy
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《自动化学报》浏览原始摘要信息
	点击此处可从《自动化学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏