TR-light：基于多信号灯强化学习的交通组织方案优化算法 TR-light:traffic organization plan optimization algorithm based on multiple traffic signal lights reinforcement learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

TR-light：基于多信号灯强化学习的交通组织方案优化算法

引用本文：	吴昊昇,郑皎凌.TR-light：基于多信号灯强化学习的交通组织方案优化算法[J].计算机应用研究,2022,39(2):504-509+514.

作者姓名：	吴昊昇郑皎凌

作者单位：	成都信息工程大学软件工程学院

基金项目：	四川省科技厅应用基础研究项目(2020YJ0430);基于群体智能的区域交通流量精准控制技术应用研究。

摘要：	针对多变环境条件下的交通堵塞问题，将强化学习、神经网络、多智能体和交通仿真技术结合起来，提出了用于优化多路口条件下交通状况的trajectory reward light(TR-light)模型。该方法具有几个显著特点：基于红绿灯拟定交通组织方案；将多智能体强化学习用于红绿灯控制；通过红绿灯的协同达到区域级的交通组织优化；在智能体每次行为执行结束后实施轨迹重构，在OD对不改变的情况下改变车辆行驶路径，根据方案和重构轨迹来计算智能体的最终回报。通过SUMO进行交通仿真实验和交通指标对比，验证了该模型在多交叉口中能够提高路网畅通率，改善交通状态。实验表明该模型可行，可有效缓解交通拥堵。
关键词：	多智能体强化学习 SUMO 红绿灯
收稿时间：	2021/6/4 0:00:00
修稿时间：	2022/1/17 0:00:00
TR-light:traffic organization plan optimization algorithm based on multiple traffic signal lights reinforcement learning

Wu Haosheng,Zheng Jiaoling,Wang Maofan.TR-light:traffic organization plan optimization algorithm based on multiple traffic signal lights reinforcement learning[J].Application Research of Computers,2022,39(2):504-509+514.

Authors:	Wu Haosheng Zheng Jiaoling Wang Maofan

Affiliation:	(School of Software Engineering,Chengdu University of Information Technology,Chengdu 610225,China)

Abstract:	Focusing on the problem with traffic congestion under changing environmental conditions, the paper proposed a trajectory reward light(TR-Light) model by combining reinforcement learning, neural network, multi-agent and traffic simulation technology to optimize the traffic at multi-intersections. This method had considerable merits in the following aspects. The traffic organization plan was formulated based on traffic lights; multi-agent reinforcement learning was used on traffic light control; regional traffic organization was optimized through the coordination of traffic lights; the agent implemented trajectory reconstruction after the execution of each behavior so as to change the vehicle travel path without changing the OD pair, and to calculate the final reward of the agent according to the plan and reconstructed trajectory. Finally, it conducted a traffic simulation experiment through SUMO. The comparison of traffic indicators verifies that the proposed model improves the smoothness of the road network and the traffic state at the multi-intersections. Experiments show that the model is feasible and effectively mitigates the traffic congestion.

Keywords:	multi-agent reinforcement learning SUMO traffic lights
本文献已被维普等数据库收录！
	点击此处可从《计算机应用研究》浏览原始摘要信息
	点击此处可从《计算机应用研究》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏