首页 | 本学科首页   官方微博 | 高级检索  
     

基于增强学习的网格化出租车调度方法
引用本文:何胜学.基于增强学习的网格化出租车调度方法[J].计算机应用研究,2019,36(3).
作者姓名:何胜学
作者单位:上海理工大学管理学院,上海,200093
基金项目:上海市自然科学基金项目 (18ZR1426200);上海理工大学人文社科攀登重点项目(SK17PA02);上海市一流学科建设项目(S1201YLXK)
摘    要:高度信息化的网格化城市管理可以为出租车运营优化提供新的实时动态乘客需求信息和车辆位置信息。以此为契机,针对城市出租车空驶率高和司乘匹配率低的问题,提出了一种网格化的出租车实时动态调度的增强学习控制方法。通过为出租车提供空驶巡游的动态最佳路线,新的控制方法旨在提高出租车的服务效率,并降低乘客的等待时间。首先,以城市单元网格为基础,明确出租车调度的关键问题;其次,以空驶路线的动态调整为控制手段,建立调度的增强学习模型;最后,给出求解模型的Q学习算法,并通过算例验证新调度方法的有效性。研究表明新方法可以有效提高司乘匹配率、增加总的出租车运营收入、减少乘客平均等车时间和减少总的出租车空驶时间。

关 键 词:城市交通  出租车调度  增强学习  网格化管理  自适应式控制
收稿时间:2017/11/7 0:00:00
修稿时间:2019/1/28 0:00:00

Grid-based taxi dispatching method based on reinforcement learning
He Sheng-Xue.Grid-based taxi dispatching method based on reinforcement learning[J].Application Research of Computers,2019,36(3).
Authors:He Sheng-Xue
Affiliation:Business School,University of Shanghai for Science and Technology
Abstract:Highly-informed grid-based city management can supply the real time passenger information and the position information of taxis for taxi operation optimization. On this account, we proposed a grid-based taxi dispatching dynamic control method based on reinforcement learning to solve the problem of the high vacant taxis rate and the low matching rate between taxis and passengers. By providing the optimal cruising routes for the vacant taxis, the new control method aims to improve the service level of taxis and to lower the waiting time of passengers. Firstly, based on the grids of city, we clarified the key problem of taxi dispatching. Secondly, by using the adjustment of vacant taxi route as the control action, we formulated the reinforcement learning model of taxi dispatching. At last, we proposed the corresponding Q learning algorithm to solve the new model. Numerical example demonstrated the effectiveness of the new dispatching method. The results show that the new method can not only increase the match rate between taxis and passengers and the total income of operation of taxi service, but also reduce the average waiting time of passengers and the total travel time of vacant taxis.
Keywords:urban transportation  taxi dispatching  reinforcement learning  grid management  adaptive control
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号