首页 | 本学科首页   官方微博 | 高级检索  
     

基于行人安全的交通信号灯智能控制算法研究
引用本文:张乾隆,胡智群,肖海林.基于行人安全的交通信号灯智能控制算法研究[J].计算机测量与控制,2022,30(4):114-120.
作者姓名:张乾隆  胡智群  肖海林
作者单位:湖北大学计算机与信息工程学院,武汉 430062
基金项目:国家自然科学基金项目(面上项目,重点项目,重大项目)
摘    要:提出了一种基于深度确定性策略梯度(Deep Deterministic Policy Gradient , DDPG)的行人安全智能交通信号控制算法。通过对交叉口数据的实时观测,综合考虑行人安全与车辆通行效率,智能地调控交通信号周期时长,相位顺序以及相位持续时间,实现交叉路口安全高效的智能控制。同时,采用优先经验回放提高采样效率,加速了算法收敛。由于行人安全与车辆通行效率存在相互矛盾,研究中通过精确地设计强化学习的奖励函数,折中考虑行人违规引起的与车辆的冲突量和车辆通行的速度,引导交通信号灯学习路口行人的行为,学习最佳的配时方案。仿真结果表明在动态环境下,该算法在行人与车辆冲突量,车辆的平均速度、等待时间和队列长度均优于现有的固定配时方案和其他的智能配时方案。

关 键 词:交通信号灯  动态配时  强化学习  行人安全  车辆效率  优先经验回放
收稿时间:2021/11/8 0:00:00
修稿时间:2021/12/1 0:00:00

Research on intelligent control algorithm of traffic light based on pedestrian safety
ZHANG Qianlong,HU Zhiqun,XIAO Hailin.Research on intelligent control algorithm of traffic light based on pedestrian safety[J].Computer Measurement & Control,2022,30(4):114-120.
Authors:ZHANG Qianlong  HU Zhiqun  XIAO Hailin
Abstract:An intelligent traffic signal control algorithm based on Deep Deterministic Policy Gradient (DDPG) with Pedestrian Safe is proposed. Through real-time observation of intersection data, the pedestrian safety and vehicle traffic efficiency are comprehensively considered, and the cycle duration, phase sequence and phase duration of traffic signals are intelligently controlled, safe and efficient intelligent control of intersections is realized. Meanwhile, priority empirical replay is adopted to improve sampling efficiency and accelerate algorithm convergence.Due to the contradiction between pedestrian safety and vehicle traffic efficiency, by accurately designing the reward function of reinforcement learning,the study considers the amount of pedestrian-vehicle conflicts caused by pedestrian violations and the speed of vehicles, guides traffic light to learn pedestrian behaviors at intersections, and learns the best timing scheme. The simulation results show that in the dynamic environment, the algorithm in terms of the number of collisions between pedestrians and vehicles, the average speed of vehicles, waiting time and queue length is better than the existing fixed timing schemes and other intelligent timing schemes.
Keywords:traffic signal light  dynamic timing  reinforcement learning  pedestrian safety  vehicle efficiency  prioritized experience replay
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机测量与控制》浏览原始摘要信息
点击此处可从《计算机测量与控制》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号