首页 | 本学科首页   官方微博 | 高级检索  
     

Q学习算法在机会频谱接入信道选择中的应用
引用本文:赵,彪,李,鸥,栾红志.Q学习算法在机会频谱接入信道选择中的应用[J].信号处理,2014,30(3):298-305.
作者姓名:        栾红志
作者单位:信息工程大学信息系统工程学院
基金项目:国家科技重大专项(2008ZX03006);国家863计划(2012AA711)
摘    要:针对“先听后传”的机会频谱接入中认知用户的信道选择问题,本文提出了一种基于Q学习的信道选择算法。在非理想感知的条件下,通过建立认知用户的信道选择模型并设计恰当的奖励函数,使智能体能够与未知环境不断交互和学习,进而选择长期累积回报最大的信道接入。在学习过程中,本文引入了Boltzmann实验策略,运用模拟退火思想实现了资源探索与资源利用之间的折衷。仿真结果表明,所提算法能够在未知环境先验知识条件下可以快速选择性能较好的信道接入,有效提高认知用户的接入吞吐量和系统的平均容量。 

关 键 词:认知无线电    机会频谱接入    信道选择    Q学习
收稿时间:2013-09-30

Application of Q-Learning algorithm in channel selection for opportunistic spectrum access
Affiliation:Department of Information system Engineering, Information Engineering University
Abstract:Considering the problem of channel selection for opportunistic spectrum access (OSA), a Q Learning based channel selection scheme was proposed for OSA in this paper. A secondary user detected the channels licensed to some primary users periodically before it decided whether to transmit in the OSA system. Under imperfect sensing circumstances, the construction of channel selection model of the secondary user and the designation of an appropriate reward function play a significant role in the continuous interaction and learning between the agent and unknown environment, thus selecting the channel with the maximum cumulative reward. During the learning stage, a Boltzmann learning rule using simulated annealing ideas was employed to realize the tradeoff between channel exploration and exploitation. As the simulation results show, the proposed algorithm can get access to suitable channel, and raise the average system capacity and throughput of the secondary user effectively in the absence of prior knowledge on the channel environment. 
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《信号处理》浏览原始摘要信息
点击此处可从《信号处理》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号