nav emailalert searchbtn searchbox tablepage yinyongbenwen piczone journalimg journalInfo journalinfonormal searchdiv searchzone qikanlogo popupnotification paper paperNew
2025, 01, v.55 28-35
基于改进D3QN的单点交叉口信号控制研究
基金项目(Foundation): 江苏省扬州市产业前瞻与共性关键技术——产业前瞻研发重点资助项目(YZ2021016); 江苏省扬州市2021年市级计划——市校合作专项资助项目(YZ2021159)~~
邮箱(Email):
DOI:
发布时间: 2024-12-19
出版时间: 2024-12-19
网络发布时间: 2024-12-19
移动端阅读
摘要:

近年交通拥堵已成为制约城市经济发展的重要问题,利用深度强化学习(Deep Reinforcement Learning, DRL)对交通信号灯进行自适应控制是缓解交通拥堵的研究热点。针对决斗双重深度Q网络(Dueling Double Deep Q-Network, D3QN)算法在交通信号控制中存在的样本利用率低、学习速度慢,以及路网状态信息复杂且灵活性差等问题,基于非均匀划分道路的离散交通状态编码(Discrete Traffic State Encode, DTSE)方法,提出一种D3PQN2交通信号控制算法。该算法在D3QN算法基础上引入噪声网络、优先级经验回放技术来提高样本的利用效率以及学习速度,通过噪声扰动代替传统的ε-贪婪策略,使得算法能够更快更好地收敛到全局最优解。以扬州市文昌路和扬子江路交叉口为例,在Weibull分布生成的车流下进行实验,结果表明,改进后的算法相较于对抗深度Q网络(Dueling Deep Q-Network, Dueling DQN)算法和固定配时的控制方法,车辆平均排队长度分别减少了12.11%和67.44%,累计延误时间分别减少了13.89%和42.88%,具有更好的控制效果。

Abstract:

In recent years, traffic congestion has become an important issue restricting the development of urban economy. Using Deep Reinforcement Learning(DRL) to adaptively control traffic lights has become a research hotspot to alleviate traffic congestion. To solve the problems of low sample utilization, slow learning speed, complex road network state information and poor flexibility in the Dueling Double Deep Q-Network(D3QN) algorithm in traffic signal control, a traffic signal control algorithm D3PQN2 is proposed based on the Discrete Traffic State Encode(DTSE) method for non-uniformly divided roads. Based on the D3QN algorithm, the algorithm introduces noise network and priority experience replay technology to improve the utilization efficiency and learning speed of samples, and replaces the traditional ε-greedy strategy with noise perturbation, so that the algorithm can converge to the global optimal solution faster and better. Taking the intersection of Wenchang Road and Yangtze River Road in Yangzhou as an example, experiments are carried out based on the traffic flow generated by Weibull distribution. The results show that compared with the Dueling Deep Q-Network(Dueling DQN) algorithm and the fixed timing control method, the average queue length of vehicles is reduced by 12.11% and 67.44%, and the cumulative delay time is reduced by 13.89% and 42.88%, respectively, proving that the method has better control effect.

参考文献

[1] 宋国治,苏鹏博,刘畅,等.基于Q 学习的智能交通信号灯优化[J].郑州大学学报(理学版),2022,54(5):57-63.

[2] 白静静,任安虎,李珊.基于SARSA 学习的单交叉口配时优化仿真与设计[J]].国外电子测量技术,2020,39(6):76-80.

[3] 徐东伟,周磊,王达,等.基于深度强化学习的城市交通信号控制综述[J].交通运输工程与信息学报,2022,20(1):15-30.

[4] 陆丽萍,程垦,褚端峰,等.基于竞争循环双 Q 网络的自适应交通信号控制[J].中国公路学报,2022,35(8):267-277.

[5] 龙顺忠,王志建,刘皓.网联环境下基于深度强化学习的单路口交通信号控制优化[J].工业控制计算机,2020,33(10):16-19.

[6] LIANG X Y,DU X S,WANG G L,et al.A Deep Reinforcement Learning Network for Traffie Light Cycle Control [J].IEEE Transactions on Vehicular Technology,2019,68(2):1243-1253.

[7] 赖建辉.基于D3QN的交通信号控制策略[J].计算机科学,2019,46(增刊2):117-121.

[8] 张国有,宋世峰.基于D3QN的交通灯控制优化[J].计算机与现代化,2023(7):30-35.

[9] 张玺君,聂生元,李喆,等.基于自注意力机制的深度强化学习交通信号控制[J].交通运输系统工程与信息,2024,24(2):96-104.

[10] 刘志,曹诗鹏,沈阳,等.基于改进深度强化学习方法的单交叉口信号控制[J].计算机科学,2020,47(12):226-232.

[11] 周瑶瑶,李烨.基于排序优先经验回放的竞争深度Q 网络学习[J].计算机应用研究,2020,37(2):486-488.

[12] 刘森,李玺,黄运.基于改进DQN算法的NPC行进路线规划研究[J].无线电工程,2022,52(8):1441-1446.

[13] XIE Z Z,HU L,HUANG Y,et al.A Semiopportunistic Task Allocation Framework for Mobile Crowdsensing with Deep Learning[J].Wireless Communications and Mobile Computing,2021,2021:1-15.

[14] 孟水仙,刘艳超,王树彬.基于多智能体深度强化学习的车联网资源分配方法[J].无线电工程,2024,54(6):1388-1397.

[15] ZHANG Y F,ZOU Y H,ZHAO X D.Manufacturing Resource Scheduling Based on Deep Q-Network[J].Wuhan University Journal of Natural Sciences,2022,27(6):531-538.

[16] ZHANG X,LIU Y,XU X,et al.Structural Relational Inference Actor-Critic for Multi-agent Reinforcement Learning[J].Neurocomputing,2021,459:383-394.

[17] HUANG L B,QU X H.Improving Traffic Signal Control Operations Using Proximal Policy Optimization[J].IET Intelligent Transport Systems,2023,17(3):592-605.

[18] DUCROCQ R,FARHI N.Deep Reinforcement Q-Learning for Intelligent Traffic Signal Control with Partial Detection[J].International Journal of Intelligent Transportation Systems Research,2023,21:192-206.

基本信息:

中图分类号:U491.54

引用信息:

[1]金志琦,张正华,姜邦宇,等.基于改进D3QN的单点交叉口信号控制研究[J].无线电工程,2025,55(01):28-35.

基金信息:

江苏省扬州市产业前瞻与共性关键技术——产业前瞻研发重点资助项目(YZ2021016); 江苏省扬州市2021年市级计划——市校合作专项资助项目(YZ2021159)~~

发布时间:

2024-12-19

出版时间:

2024-12-19

网络发布时间:

2024-12-19

检 索 高级检索