首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于多智能体强化学习的流量分配算法
引用本文:程超,滕俊杰,赵艳领,宋梅.一种基于多智能体强化学习的流量分配算法[J].北京邮电大学学报,2019,42(6):43-48,57.
作者姓名:程超  滕俊杰  赵艳领  宋梅
作者单位:北京邮电大学电子工程学院,北京100876;中国金融认证中心,北京100054;机械工业仪器仪表综合技术经济研究所,北京100055
基金项目:国家重点研发计划项目(2018YFB1201500);国家自然科学基金项目(61871046);北京市自然科学基金项目(L171011);北京市重大专项项目(Z181100003118012)
摘    要:传统的流量工程策略的研究大多集中在构建和求解数学模型方面,其计算复杂度过高,为此,提出了一种经验驱动的基于多智能体强化学习的流量分配算法.该算法无需求解复杂数学模型即可在预计算的路径上进行有效的流量分配,从而高效且充分地利用网络资源.算法在软件定义网络控制器上进行集中训练,且在训练完成后再接入交换机或者路由器上分布式执行,同时也避免和控制器的频繁交互.实验结果表明,相对于最短路径和等价多路径算法,新算法有效减少了网络的端到端时延,并且增大了网络吞吐量.

关 键 词:流量工程  多智能体强化学习  软件定义网络  时延  吞吐量
收稿时间:2019-07-10

Traffic Distribution Algorithm Based on Multi-Agent Reinforcement Learning
CHENG Chao,TENG Jun-jie,ZHAO Yan-ling,SONG Mei.Traffic Distribution Algorithm Based on Multi-Agent Reinforcement Learning[J].Journal of Beijing University of Posts and Telecommunications,2019,42(6):43-48,57.
Authors:CHENG Chao  TENG Jun-jie  ZHAO Yan-ling  SONG Mei
Affiliation:1. School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China;
2. China Financial Certification Authority, Beijing 100054, China;
3. Instrumentation Technology and Economy Institute, Beijing 100055, China
Abstract:Most of the researches on traditional traffic engineering strategies focus on constructing and solving mathematical models. To reduce computational complexity,an experience-driven traffic allocation algorithm based on multi-agent reinforcement learning was proposed. It can effectively distribute traffic on pre-calculated paths without solving complex mathematical models and then fully utilize network resources. The algorithm performs centralized training on the software defined networking controller,and can be executed on the access switch or router in a distributed way after the training is completed. Frequent interactions with the controller are avoided at the same time. Experiments show that the algorithm is effective in reducing the end-to-end delay and increasing throughput of the network with respect to the shortest-path and the equal-cost multi-path.
Keywords:traffic engineering  multi-agent reinforcement learning  software-defined networking  delay  throughput  
本文献已被 万方数据 等数据库收录!
点击此处可从《北京邮电大学学报》浏览原始摘要信息
点击此处可从《北京邮电大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号