首页 | 本学科首页   官方微博 | 高级检索  
     

基于深度强化学习的工业物联网多用户频谱分配
引用本文:邵瑞宇,黎智雄,任瑾璇.基于深度强化学习的工业物联网多用户频谱分配[J].电讯技术,2021,61(6):666-673.
作者姓名:邵瑞宇  黎智雄  任瑾璇
作者单位:广东工业大学 自动化学院,广州510006
基金项目:广东省信息物理融合系统重点实验室和智能制造信息物理融合系统集成技术国家地方联合工程研究中心开放课题(008)
摘    要:随着5G时代的来临,工业物联网将迎来蓬勃发展.然而,联网设备数量的不断增加,加剧了有限的频谱资源与大量的通信需求之间的矛盾.针对以上问题,提出了一种基于聚类分组和深度强化学习的合作式动态频谱分配算法,使用户可以获得较低的信息传输中断概率以及较少的多跳转发次数,快速找到信息传输的最优路径.在动态频谱分配中,该算法可以有效降低主、次用户信道接入的碰撞概率,提升频谱资源的利用率.对于少部分计算能力有限的用户,通过协调同组次用户的计算能力来完成策略的训练,实现了计算资源的高效利用.经过多次仿真实验验证,所提出的联合算法与现有的方法相比具有更高的信道利用率和更低的用户接入碰撞率.

关 键 词:工业物联网  认知无线电  深度强化学习  频谱分配  聚类分组

Multi-user spectrum allocation using deep reinforcement learning for industrial IoT
SHAO Ruiyu,LI Zhixiong,REN Jinxuan.Multi-user spectrum allocation using deep reinforcement learning for industrial IoT[J].Telecommunication Engineering,2021,61(6):666-673.
Authors:SHAO Ruiyu  LI Zhixiong  REN Jinxuan
Affiliation:School of Automation,Guangdong University of Technology,Guangzhou 510006,China
Abstract:With the advent of the 5G era,the technology of industrial Internet of Things(IIoT) is springing up vigorously.However,as the number of Internet devices is constantly rising,conflict between limited spectrum resources and high demand for communication is intensified.For the above problems,this paper proposes a dynamic grouping strategy based on a new grouping clustering mode and deep reinforcement learning(DRL) algorithm.The algorithm can relatively lower interruptive possibility of information transmission and multiple jumps for users who can efficiently find the optimal path of information transmission.Furthermore,in the process of dynamic spectrum allocation,the algorithm can effectively reduce the collision probability and promote the utilization of primary and secondary user of spectrum resources channel.Additionally,for a small portion of those users,with limited computing ability,they can complete the strategy training by cooperating with other users of the same groups,thus realizing the efficient utilization of computing resources.Numerous simulation experiments suggest that,compared with methods on hand,the proposed joint algorithm has the advantages over higher rate channel utilization and lower collision probability of user access.
Keywords:industrial Internet of Things  cognitive radio  deep reinforcement learning  spectrum allocation  cluster grouping
本文献已被 万方数据 等数据库收录!
点击此处可从《电讯技术》浏览原始摘要信息
点击此处可从《电讯技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号