首页 | 本学科首页   官方微博 | 高级检索  
     

基于Dueling Double DQN的交通信号控制方法
引用本文:叶宝林,陈栋,刘春元,陈滨,吴维敏. 基于Dueling Double DQN的交通信号控制方法[J]. 计算机测量与控制, 2024, 32(7): 154-161
作者姓名:叶宝林  陈栋  刘春元  陈滨  吴维敏
作者单位:浙江理工大学 信息科学与工程学院,,,,
基金项目:国家自然科学基金青年科学基金项目(No. 61603154),浙江省自然科学基金探索项目(No. LTGS23F030002),嘉兴市应用性基础研究项目(2023AY11034),工业控制技术国家重点实验室开放课题(No. ICT2022B52)
摘    要:为了提高交叉口通行效率缓解交通拥堵,深入挖掘交通状态信息中所包含的深层次隐含特征信息,提出了一种基于Dueling Double DQN (D3QN) 的单交叉口交通信号控制方法。构建了一个基于深度强化学习Double DQN(DDQN)的交通信号控制模型,对动作-价值函数的估计值和目标值迭代运算过程进行了优化,克服基于深度强化学习DQN的交通信号控制模型存在收敛速度慢的问题。设计了一个新的Dueling Network解耦交通状态和相位动作的价值,增强Double DQN (DDQN) 提取深层次特征信息的能力。基于微观仿真平台SUMO搭建了一个单交叉口模拟仿真框架和环境,开展仿真测试。仿真测试结果表明,与传统交通信号控制方法和基于深度强化学习DQN的交通信号控制方法相比,所提方法能够有效减少车辆平均等待时间、车辆平均排队长度和车辆平均停车次数,明显提升交叉口通行效率。

关 键 词:交通信号控制  深度强化学习  Dueling Double DQN(D3QN)  Dueling Network  
收稿时间:2023-11-26
修稿时间:2024-01-09

Traffic Signal Control Method based on Dueling Double DQN
Abstract:In order to improve the efficiency of intersection traffic and alleviate traffic congestion, a single intersection traffic signal control method based on Dueling Double DQN (D3QN) is proposed by deeply exploring the deep hidden feature information contained in traffic status information. A traffic signal control model based on deep reinforcement learning Double DQN (DDQN) was constructed, and the iterative operation process of the estimated value and target value of the action value function was optimized to overcome the problem of slow convergence speed in traffic signal control models based on deep reinforcement learning DQN. A new Dueling Network has been designed to decouple the value of traffic states and phase actions, enhancing the ability of Double DQN (DDQN) to extract deep level feature information. A single intersection simulation framework and environment were built based on the micro simulation platform simulation of urban mobility(SUMO)for simulation testing. The simulation test results show that compared with traditional traffic signal control methods and traffic signal control methods based on deep reinforcement learning DQN, the proposed method can effectively reduce the average waiting time, average queue length, and average number of stops of vehicles, significantly improving the efficiency of intersection traffic.
Keywords:Traffic signal control   Deep reinforcement learning   Dueling double DQN(D3QN)   Dueling network.
点击此处可从《计算机测量与控制》浏览原始摘要信息
点击此处可从《计算机测量与控制》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号