强化学习在车辆路径问题中的研究综述 Survey on Vehicle Reinforcement Learning in Routing Problem期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

强化学习在车辆路径问题中的研究综述

引用本文：	牛鹏飞,王晓峰,芦磊,张九龙.强化学习在车辆路径问题中的研究综述[J].计算机工程与应用,2022,58(1):41-55.

作者姓名：	牛鹏飞王晓峰芦磊张九龙

作者单位：	1.北方民族大学计算机科学与工程学院，银川 750021 2.北方民族大学图像图形智能处理国家民委重点实验室，银川 750021

基金项目：	国家自然科学基金(62062001,61762019,61862051,61962002);宁夏自然科学基金(2020AAC03214,2020AAC03219,2019AAC03120,2019AAC03119);北方民族大学重大专项(ZDZX201901)。

摘要：	车辆路径问题是物流运输优化中的核心问题，目的是在满足顾客需求下得到一条最低成本的车辆路径规划。但随着物流运输规模的不断增大，车辆路径问题求解难度增加，并且对实时性要求也不断提高，已有的常规算法不再适应实际要求。近年来，基于强化学习算法开始成为求解车辆路径问题的重要方法，在简要回顾常规方法求解车辆路径问题的基础上，重点总结基于强化学习求解车辆路径问题的算法，并将算法按照基于动态规划、基于价值、基于策略的方式进行了分类；最后对该问题未来的研究进行了展望。
关键词：	车辆路径问题马尔科夫决策过程强化学习深度强化学习
Survey on Vehicle Reinforcement Learning in Routing Problem

NIU Pengfei,WANG Xiaofeng,LU Lei,ZHANG Jiulong.Survey on Vehicle Reinforcement Learning in Routing Problem[J].Computer Engineering and Applications,2022,58(1):41-55.

Authors:	NIU Pengfei WANG Xiaofeng LU Lei ZHANG Jiulong

Affiliation:	1.College of Computer Science and Engineering, North Minzu University, Yinchuan 750021, China 2.The Key Laboratory of Images & Graphics Intelligent Processing of State Ethnic Affairs Commission, North Minzu University, Yinchuan 750021, China

Abstract:	Vehicle routing problem is the key technologies in the field of logistics research. Its purpose is to get a lowest cost vehicle routing plan while meeting the customer’s needs. However, with the increasing of problem size in logistics transportation, the real-time requirement of solving vehicle routing problem is increasing, and the traditional algorithm cannot realize the requirements of the industry gradually. For decades, a number of new methods use reinforcement learning and deep reinforcement learning to solve vehicle routing problem. Base on simple analysis of conventional methods for solving vehicle routing problem, this review summaries the current algorithms for solving vehicle routing problem based on reinforcement learning. Reinforcement learning algorithms are divided into dynamic programming, value-based and policy-based. This paper summarizes the theoretical foundation and studying status. Finally, the future development direction of vehicle routing problem based on reinforcement learning and deep reinforcement learning is prospected.

Keywords:	vehicle routing problem Markov decision process reinforcement learning deep reinforcement learning
本文献已被维普万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏