首页 | 本学科首页   官方微博 | 高级检索  
     

A Reinforcement Learning-based Approach to Dynamic Job-shop Scheduling
引用本文:WEI Ying-Zi~(1,2) ZHAO Ming-Yang~1 ~1(Shenyang Institute of Automation,Chinese Academy of Sciences,Shenyang 110016)~2(Shenyang Ligong University,Shenyang 110168). A Reinforcement Learning-based Approach to Dynamic Job-shop Scheduling[J]. 自动化学报, 2005, 0(5)
作者姓名:WEI Ying-Zi~(1  2) ZHAO Ming-Yang~1 ~1(Shenyang Institute of Automation  Chinese Academy of Sciences  Shenyang 110016)~2(Shenyang Ligong University  Shenyang 110168)
作者单位:WEI Ying-Zi~(1,2) ZHAO Ming-Yang~1 ~1(Shenyang Institute of Automation,Chinese Academy of Sciences,Shenyang 110016)~2(Shenyang Ligong University,Shenyang 110168)
基金项目:Supported by 973 program of P.R.China (2002CB312200)
摘    要:Production scheduling is critical to manufacturing system.Dispatching rules are usually applied dynamically to schedule (?)he job in a dynamic job-shop.Existing scheduling approaches sel- dom address machine selection in the scheduling process.Composite rules,considering both machine selection and job selection,are proposed in this paper.The dynamic system is trained to enhance its learning and adaptive capability by a reinforcement learning(RL)algorithm.We define the concep- tion of pressure to describe the system feature.Designing a reward function should be guided by the scheduling goal to accurately record the learning progress.Competitive results with the RL-based approach show that it can be used as real-time scheduling technology.


A Reinforcement Learning-based Approach to Dynamic Job-shop Scheduling
WEI Ying-Zi. A Reinforcement Learning-based Approach to Dynamic Job-shop Scheduling[J]. Acta Automatica Sinica, 2005, 0(5)
Authors:WEI Ying-Zi
Affiliation:WEI Ying-Zi~
Abstract:Production scheduling is critical to manufacturing system.Dispatching rules are usually applied dynamically to schedule (?)he job in a dynamic job-shop.Existing scheduling approaches sel- dom address machine selection in the scheduling process.Composite rules,considering both machine selection and job selection,are proposed in this paper.The dynamic system is trained to enhance its learning and adaptive capability by a reinforcement learning(RL)algorithm.We define the concep- tion of pressure to describe the system feature.Designing a reward function should be guided by the scheduling goal to accurately record the learning progress.Competitive results with the RL-based approach show that it can be used as real-time scheduling technology.
Keywords:Reinforcement learning  composite rules  mean tardiness  dynamic job-shop scheduling  
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号