首页 | 本学科首页   官方微博 | 高级检索  
     

基于Q-学习算法的交通控制与诱导协同模式的在线选择
引用本文:杨庆芳,杨朝.基于Q-学习算法的交通控制与诱导协同模式的在线选择[J].吉林大学学报(工学版),2010,40(5).
作者姓名:杨庆芳  杨朝
作者单位:1. 吉林大学,汽车动态模拟国家重点实验室,长春,130022;吉林大学,交通学院,长春,130022
2. 吉林大学,交通学院,长春,130022
基金项目:"863"国家高技术研究发展计划项目 
摘    要:采用Q-学习算法实现了交通控制与诱导协同模式的在线选择。首先,采用Q-学习算法训练多智能体,根据多智能体内部的推理得到不同交通状态下的最优协同模式,最终实现交通控制与交通诱导协同模式的在线选择与转换。仿真结果表明,本文提出的基于Q-学习算法的协同模式选择方法在一般交通拥挤状态下具有较好的协同控制效果,对比离线式模式选择方法更能适应交通状态的不断变化,从而达到有效避免严重交通拥堵、改善路网性能的目的。

关 键 词:交通运输工程  交通控制与诱导协同  模式选择  Q-学习算法  回报函数

On-line selection method of the traffic control and route guidance collaboration mode based on Q-learning algorithm
YANG Qing-fang,YANG Chao.On-line selection method of the traffic control and route guidance collaboration mode based on Q-learning algorithm[J].Journal of Jilin University:Eng and Technol Ed,2010,40(5).
Authors:YANG Qing-fang  YANG Chao
Abstract:The on-line traffic control and route guidance collaboration mode selection was realized by the Q-learning algorithm. Using the multi-intelligence agents trained with the Q-learning algorithm, the optimal collaboration mode was obtained under different traffic conditions according to the inner inference of the multi-intelligence agent. So,the on-line selection and switching of the traffic control and route guidance collaboration mode was accomplished. The simulation results show that the proposed collaboration mode selection method based on the Q-learning is characterized by better collaboration control effect under the ordinary traffic congestion condition and more adaptive to constantly changing traffic condition than the traditional off-line mode selection method. The proposed method is helpful to avoiding the heavy traffic congestion and improving the traffic network performance.
Keywords:engineering of communications and transportation  collaboration of traffic control and route guidance  mode selection  Q-learning algorithm  reward function
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号