基于深度强化学习的多小区NOMA能效优化功率分配算法 Multi-Cell NOMA Energy Efficiency Optimization Power Allocation Algorithm Based on Deep Reinforcement Learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于深度强化学习的多小区NOMA能效优化功率分配算法

引用本文：	胡浪涛,毕松姣,刘全金,吴建岚,杨瑞.基于深度强化学习的多小区NOMA能效优化功率分配算法[J].电子科技大学学报（自然科学版）,2022,51(3):384-391.

作者姓名：	胡浪涛毕松姣刘全金吴建岚杨瑞

作者单位：	安庆师范大学电子工程与智能制造学院　安徽安庆　246133

基金项目：	国家自然科学基金(61603003,62171002)；;安徽省教育厅自然科学基金(KJ2019A0554)；

摘要：	在下行多小区非正交多址接入系统中，功率分配是决定系统性能的关键因素之一。由于多小区系统间的功率优化问题的非凸性，获得最优功率分配在求解上非常困难。为此提出了一种基于深度强化学习最大化能效的功率分配算法，将深度Q网络作为动作?状态值函数，将系统能效直接设置为奖励函数，优化信道功率分配，使系统能量效率最大化。仿真结果表明，该算法比加权最小均方误差、分式规划、最大功率和随机功率算法等能够获得更高的系统能量效率，在算法计算复杂度、收敛速度和稳定性方面也有较好表现。
关键词：	深度Q网络能量效率非正交多址接入功率分配强化学习
收稿时间：	2021-07-21
Multi-Cell NOMA Energy Efficiency Optimization Power Allocation Algorithm Based on Deep Reinforcement Learning

HU Langtao,BI Songjiao,LIU Quanjin,WU Jianlan,YANG Rui.Multi-Cell NOMA Energy Efficiency Optimization Power Allocation Algorithm Based on Deep Reinforcement Learning[J].Journal of University of Electronic Science and Technology of China,2022,51(3):384-391.

Authors:	HU Langtao BI Songjiao LIU Quanjin WU Jianlan YANG Rui

Affiliation:	School of Electronic Engineering and Intelligent Manufacturing, Anqing Normal University　Anqing Anhui　246133

Abstract:	In a downlink multi-cell non-orthogonal multiple access system, power allocation is one of the key factors to determine system performance. Due to the non-convexity of the power optimization problem among multi-cell systems, it is very difficult to obtain the optimal power allocation. The power allocation algorithm based on deep reinforcement learning is proposed to maximize energy efficiency in this paper, which is simple and efficient. The algorithm takes the deep Q network as the action-state value function, system energy efficiency is directly set as a reward function, which optimizes channel power allocation and maximizes system energy efficiency. The simulation results show that the algorithm of proposed scheme is more effective than the weighted minimum mean square error, fractional programming, maximum power and random power algorithms in achieving higher system energy efficiency. The scheme also has better performances in algorithm calculation complexity, convergence speed and stability.

Keywords:
本文献已被万方数据等数据库收录！
	点击此处可从《电子科技大学学报（自然科学版）》浏览原始摘要信息
	点击此处可从《电子科技大学学报（自然科学版）》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏