基于Q学习的任务调度问题的改进研究 Improvement of task scheduling based on Q-learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于Q学习的任务调度问题的改进研究

引用本文：	刘晓平,杜琳,石慧. 基于Q学习的任务调度问题的改进研究[J]. 工程图学学报, 2012, 33(3): 11-16

作者姓名：	刘晓平杜琳石慧

作者单位：	合肥工业大学计算机与信息学院,安徽合肥,230009

基金项目：	国家自然科学基金资助项目，合肥工业大学自主创新资助项目

摘要：	论文针对协同工作中的任务调度问题,建立了相应的马尔可夫决策过程模型,在此基础上提出了一种改进的基于模拟退火的Q学习算法。该算法通过引入模拟退火,并结合贪婪策略,以及在状态空间上的筛选判断,显著地提高了收敛速度,缩短了执行时间。最后与其它文献中相关算法的对比分析,验证了本改进算法的高效性。
关键词：	任务调度 Q学习强化学习模拟退火
Improvement of task scheduling based on Q-learning

Liu Xiaoping , Du Lin , Shi Hui. Improvement of task scheduling based on Q-learning[J]. Journal of Engineering Graphics, 2012, 33(3): 11-16

Authors:	Liu Xiaoping Du Lin Shi Hui

Affiliation:	(School of Computer and Information,Hefei University of Technology,Heifei Anhui 230009,China)

Abstract:	In this paper,a Markov Decision Process model is built to describe the problem of task scheduling in cooperative work,and a improved Q-learning algorithm based on Metropolis rule is present to solve the problem.In the algorithm,Metropolis rule combined with Greedy Strategy is introduced and a selection in state space is adopted,which accelerate the convergence,and shorten the running time.Finally,the algorithm is compared to some related algorithms of other papers,and the algorithm performance is analyzed as well,which indicates the efficiency of the improved Q-learning algorithm.

Keywords:	task scheduling Q-learning reinforcement learning simulated annealing
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏