基于秩的Q-路由选择算法 A Rank-based Q-routing Algorithm期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于秩的Q-路由选择算法

引用本文：	王月娟,张苏宁,吴水明,朱斐.基于秩的Q-路由选择算法[J].计算机与现代化,2018,0(10):1.

作者姓名：	王月娟 张苏宁 吴水明 朱斐

基金项目：	国家自然科学基金资助项目(61303108,61373094); 江苏省高校自然科学研究项目重大项目(17KJA520004); 苏州大学高校省级重点实验室项目(KJS1524)

摘要：	如何在动态变化的复杂网络中实现高效的路由选择是当前的研究热点之一。Q-学习是一种常用的强化学习算法，通过与环境的不断交互来解决未知环境中最优控制问题，能有效地完成在线式学习任务。本文提出一种基于秩的Q-路由选择(Rank-based Q-routing, RQ routing)算法。RQ routing算法在Q-学习的框架下，保留了Q-路由选择(Q-routing)算法的高效性，引入能动态计算的秩函数，用于表示当前状态在场景中的优先级，用以求解路由选择的最优解，避免等待队列过长，减少网络拥堵，提高传输速度。RQ routing算法中的秩函数具有灵活性，使用不同的秩函数即可满足各种场景的需求，保证了算法具有更好的泛化能力，克服了传统Q-routing应用场景单一的不足。实验验证了本文算法的有效性。
关键词：	强化学习 Q-学习 Q-路由选择 QoS路由计算机网络
收稿时间：	2018-10-26
A Rank-based Q-routing Algorithm

WANG Yue-juan,ZHANG Su-ning,WU Shui-ming,ZHU Fei.A Rank-based Q-routing Algorithm[J].Computer and Modernization,2018,0(10):1.

Authors:	WANG Yue-juan ZHANG Su-ning WU Shui-ming ZHU Fei

Abstract:	How to achieve efficient routing in the dynamical and complex network is one of current research hotspots. Q-learning, a frequently used reinforcement learning method, which can solve the optimal control problem in unknown environment by continuously interacting with the environment, is able to achieve on-line learning task. A rank-based Q-routing algorithm (RQ routing) is proposed. RQ routing algorithm, taking Q-learning algorithm as learning framework, and preserving the efficiency of the Q-routing algorithm, introduces the rank function that can be dynamically calculated to represent the priority of the current state in the scene, so as to solve the optimal solution of the route selection, which can avoid long waiting queue, reduce network congestion and improve the transmission speed. The rank function in the RQ routing algorithm is flexible. People can use different rank functions to meet the needs of various scenes, ensure the better generalization ability of the algorithm, and overcome the inflexibility of the traditional Q-routing application scene. The experiment verifies the effectiveness of the algorithm.

Keywords:	reinforcement learning Q-learning Q-routing QoS routing computer network

	点击此处可从《计算机与现代化》浏览原始摘要信息
	点击此处可从《计算机与现代化》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏