首页 | 本学科首页   官方微博 | 高级检索  
     

强化学习在车辆路径问题中的研究综述
引用本文:牛鹏飞,王晓峰,芦磊,张九龙.强化学习在车辆路径问题中的研究综述[J].计算机工程与应用,2022,58(1):41-55.
作者姓名:牛鹏飞  王晓峰  芦磊  张九龙
作者单位:1.北方民族大学 计算机科学与工程学院,银川 750021 2.北方民族大学 图像图形智能处理国家民委重点实验室,银川 750021
基金项目:国家自然科学基金(62062001,61762019,61862051,61962002);宁夏自然科学基金(2020AAC03214,2020AAC03219,2019AAC03120,2019AAC03119);北方民族大学重大专项(ZDZX201901)。
摘    要:车辆路径问题是物流运输优化中的核心问题,目的是在满足顾客需求下得到一条最低成本的车辆路径规划。但随着物流运输规模的不断增大,车辆路径问题求解难度增加,并且对实时性要求也不断提高,已有的常规算法不再适应实际要求。近年来,基于强化学习算法开始成为求解车辆路径问题的重要方法,在简要回顾常规方法求解车辆路径问题的基础上,重点总结基于强化学习求解车辆路径问题的算法,并将算法按照基于动态规划、基于价值、基于策略的方式进行了分类;最后对该问题未来的研究进行了展望。

关 键 词:车辆路径问题  马尔科夫决策过程  强化学习  深度强化学习  

Survey on Vehicle Reinforcement Learning in Routing Problem
NIU Pengfei,WANG Xiaofeng,LU Lei,ZHANG Jiulong.Survey on Vehicle Reinforcement Learning in Routing Problem[J].Computer Engineering and Applications,2022,58(1):41-55.
Authors:NIU Pengfei  WANG Xiaofeng  LU Lei  ZHANG Jiulong
Affiliation:1.College of Computer Science and Engineering, North Minzu University, Yinchuan 750021, China 2.The Key Laboratory of Images & Graphics Intelligent Processing of State Ethnic Affairs Commission, North Minzu University, Yinchuan 750021, China
Abstract:Vehicle routing problem is the key technologies in the field of logistics research. Its purpose is to get a lowest cost vehicle routing plan while meeting the customer’s needs. However, with the increasing of problem size in logistics transportation, the real-time requirement of solving vehicle routing problem is increasing, and the traditional algorithm cannot realize the requirements of the industry gradually. For decades, a number of new methods use reinforcement learning and deep reinforcement learning to solve vehicle routing problem. Base on simple analysis of conventional methods for solving vehicle routing problem, this review summaries the current algorithms for solving vehicle routing problem based on reinforcement learning. Reinforcement learning algorithms are divided into dynamic programming, value-based and policy-based. This paper summarizes the theoretical foundation and studying status. Finally, the future development direction of vehicle routing problem based on reinforcement learning and deep reinforcement learning is prospected.
Keywords:vehicle routing problem  Markov decision process  reinforcement learning  deep reinforcement learning
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号