基于增强学习的网格化出租车调度方法 Grid-based taxi dispatching method based on reinforcement learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于增强学习的网格化出租车调度方法

引用本文：	何胜学.基于增强学习的网格化出租车调度方法[J].计算机应用研究,2019,36(3).

作者姓名：	何胜学

作者单位：	上海理工大学管理学院,上海,200093

基金项目：	上海市自然科学基金项目 (18ZR1426200)；上海理工大学人文社科攀登重点项目(SK17PA02);上海市一流学科建设项目(S1201YLXK)

摘要：	高度信息化的网格化城市管理可以为出租车运营优化提供新的实时动态乘客需求信息和车辆位置信息。以此为契机，针对城市出租车空驶率高和司乘匹配率低的问题，提出了一种网格化的出租车实时动态调度的增强学习控制方法。通过为出租车提供空驶巡游的动态最佳路线，新的控制方法旨在提高出租车的服务效率，并降低乘客的等待时间。首先，以城市单元网格为基础，明确出租车调度的关键问题；其次，以空驶路线的动态调整为控制手段，建立调度的增强学习模型；最后，给出求解模型的Q学习算法，并通过算例验证新调度方法的有效性。研究表明新方法可以有效提高司乘匹配率、增加总的出租车运营收入、减少乘客平均等车时间和减少总的出租车空驶时间。
关键词：	城市交通出租车调度增强学习网格化管理自适应式控制
收稿时间：	2017/11/7 0:00:00
修稿时间：	2019/1/28 0:00:00
Grid-based taxi dispatching method based on reinforcement learning

He Sheng-Xue.Grid-based taxi dispatching method based on reinforcement learning[J].Application Research of Computers,2019,36(3).

Authors:	He Sheng-Xue

Affiliation:	Business School,University of Shanghai for Science and Technology

Abstract:	Highly-informed grid-based city management can supply the real time passenger information and the position information of taxis for taxi operation optimization. On this account, we proposed a grid-based taxi dispatching dynamic control method based on reinforcement learning to solve the problem of the high vacant taxis rate and the low matching rate between taxis and passengers. By providing the optimal cruising routes for the vacant taxis, the new control method aims to improve the service level of taxis and to lower the waiting time of passengers. Firstly, based on the grids of city, we clarified the key problem of taxi dispatching. Secondly, by using the adjustment of vacant taxi route as the control action, we formulated the reinforcement learning model of taxi dispatching. At last, we proposed the corresponding Q learning algorithm to solve the new model. Numerical example demonstrated the effectiveness of the new dispatching method. The results show that the new method can not only increase the match rate between taxis and passengers and the total income of operation of taxi service, but also reduce the average waiting time of passengers and the total travel time of vacant taxis.

Keywords:	urban transportation taxi dispatching reinforcement learning grid management adaptive control
本文献已被万方数据等数据库收录！
	点击此处可从《计算机应用研究》浏览原始摘要信息
	点击此处可从《计算机应用研究》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏