首页 | 本学科首页   官方微博 | 高级检索  
     

基于深度强化学习的多小区NOMA能效优化功率分配算法
引用本文:胡浪涛,毕松姣,刘全金,吴建岚,杨瑞.基于深度强化学习的多小区NOMA能效优化功率分配算法[J].电子科技大学学报(自然科学版),2022,51(3):384-391.
作者姓名:胡浪涛  毕松姣  刘全金  吴建岚  杨瑞
作者单位:安庆师范大学电子工程与智能制造学院 安徽 安庆 246133
基金项目:国家自然科学基金(61603003,62171002);;安徽省教育厅自然科学基金(KJ2019A0554);
摘    要:在下行多小区非正交多址接入系统中,功率分配是决定系统性能的关键因素之一。由于多小区系统间的功率优化问题的非凸性,获得最优功率分配在求解上非常困难。为此提出了一种基于深度强化学习最大化能效的功率分配算法,将深度Q网络作为动作?状态值函数,将系统能效直接设置为奖励函数,优化信道功率分配,使系统能量效率最大化。仿真结果表明,该算法比加权最小均方误差、分式规划、最大功率和随机功率算法等能够获得更高的系统能量效率,在算法计算复杂度、收敛速度和稳定性方面也有较好表现。

关 键 词:深度Q网络    能量效率    非正交多址接入    功率分配    强化学习
收稿时间:2021-07-21

Multi-Cell NOMA Energy Efficiency Optimization Power Allocation Algorithm Based on Deep Reinforcement Learning
HU Langtao,BI Songjiao,LIU Quanjin,WU Jianlan,YANG Rui.Multi-Cell NOMA Energy Efficiency Optimization Power Allocation Algorithm Based on Deep Reinforcement Learning[J].Journal of University of Electronic Science and Technology of China,2022,51(3):384-391.
Authors:HU Langtao  BI Songjiao  LIU Quanjin  WU Jianlan  YANG Rui
Affiliation:School of Electronic Engineering and Intelligent Manufacturing, Anqing Normal University Anqing Anhui 246133
Abstract:In a downlink multi-cell non-orthogonal multiple access system, power allocation is one of the key factors to determine system performance. Due to the non-convexity of the power optimization problem among multi-cell systems, it is very difficult to obtain the optimal power allocation. The power allocation algorithm based on deep reinforcement learning is proposed to maximize energy efficiency in this paper, which is simple and efficient. The algorithm takes the deep Q network as the action-state value function, system energy efficiency is directly set as a reward function, which optimizes channel power allocation and maximizes system energy efficiency. The simulation results show that the algorithm of proposed scheme is more effective than the weighted minimum mean square error, fractional programming, maximum power and random power algorithms in achieving higher system energy efficiency. The scheme also has better performances in algorithm calculation complexity, convergence speed and stability.
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《电子科技大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《电子科技大学学报(自然科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号