基于深度强化学习的工业物联网多用户频谱分配 Multi-user spectrum allocation using deep reinforcement learning for industrial IoT期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于深度强化学习的工业物联网多用户频谱分配

引用本文：	邵瑞宇,黎智雄,任瑾璇.基于深度强化学习的工业物联网多用户频谱分配[J].电讯技术,2021,61(6):666-673.

作者姓名：	邵瑞宇黎智雄任瑾璇

作者单位：	广东工业大学自动化学院,广州510006

基金项目：	广东省信息物理融合系统重点实验室和智能制造信息物理融合系统集成技术国家地方联合工程研究中心开放课题(008)

摘要：	随着5G时代的来临,工业物联网将迎来蓬勃发展.然而,联网设备数量的不断增加,加剧了有限的频谱资源与大量的通信需求之间的矛盾.针对以上问题,提出了一种基于聚类分组和深度强化学习的合作式动态频谱分配算法,使用户可以获得较低的信息传输中断概率以及较少的多跳转发次数,快速找到信息传输的最优路径.在动态频谱分配中,该算法可以有效降低主、次用户信道接入的碰撞概率,提升频谱资源的利用率.对于少部分计算能力有限的用户,通过协调同组次用户的计算能力来完成策略的训练,实现了计算资源的高效利用.经过多次仿真实验验证,所提出的联合算法与现有的方法相比具有更高的信道利用率和更低的用户接入碰撞率.
关键词：	工业物联网认知无线电深度强化学习频谱分配聚类分组
Multi-user spectrum allocation using deep reinforcement learning for industrial IoT

SHAO Ruiyu,LI Zhixiong,REN Jinxuan.Multi-user spectrum allocation using deep reinforcement learning for industrial IoT[J].Telecommunication Engineering,2021,61(6):666-673.

Authors:	SHAO Ruiyu LI Zhixiong REN Jinxuan

Affiliation:	School of Automation,Guangdong University of Technology,Guangzhou 510006,China

Abstract:	With the advent of the 5G era,the technology of industrial Internet of Things(IIoT) is springing up vigorously.However,as the number of Internet devices is constantly rising,conflict between limited spectrum resources and high demand for communication is intensified.For the above problems,this paper proposes a dynamic grouping strategy based on a new grouping clustering mode and deep reinforcement learning(DRL) algorithm.The algorithm can relatively lower interruptive possibility of information transmission and multiple jumps for users who can efficiently find the optimal path of information transmission.Furthermore,in the process of dynamic spectrum allocation,the algorithm can effectively reduce the collision probability and promote the utilization of primary and secondary user of spectrum resources channel.Additionally,for a small portion of those users,with limited computing ability,they can complete the strategy training by cooperating with other users of the same groups,thus realizing the efficient utilization of computing resources.Numerous simulation experiments suggest that,compared with methods on hand,the proposed joint algorithm has the advantages over higher rate channel utilization and lower collision probability of user access.

Keywords:	industrial Internet of Things cognitive radio deep reinforcement learning spectrum allocation cluster grouping
本文献已被万方数据等数据库收录！
	点击此处可从《电讯技术》浏览原始摘要信息
	点击此处可从《电讯技术》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏