首页 | 本学科首页   官方微博 | 高级检索  
     

面向众核GPU加速系统的网络编码并行化及优化
引用本文:唐绍华.面向众核GPU加速系统的网络编码并行化及优化[J].计算机工程与应用,2014,50(21):79-84.
作者姓名:唐绍华
作者单位:湖南工程职业技术学院 信息工程系,长沙 410151
基金项目:国家高技术研究发展计划(863)(No.2012AA010905);国家自然科学基金(No.60803041,No.61070037);湖南省教育厅2012年度科技项目(No.12C1024)。
摘    要:网络编码允许网络节点在数据存储转发的基础上参与数据处理,已成为提高网络吞吐量、均衡网络负载和提高网络带宽利用率的有效方法,但是网络编码的计算复杂性严重影响了系统性能。基于众核GPU加速的系统可以充分利用众核GPU强大的计算能力和有效利用GPU的存储层次结构来优化加速网络编码。基于CUDA架构提出了以片段并行的技术来加速网络编码和基于纹理Cache的并行解码方法。利用提出的方法实现了线性随机编码,同时结合体系结构对其进行优化。实验结果显示,基于众核GPU的网络编码并行化技术是行之有效的,系统性能提升显著。

关 键 词:网络编码  图形处理器(GPU)  并行  计算统一设备架构(CUDA)  优化  

Parallelizing network coding on manycore GPU-accelerated system with optimization
TANG Shaohua.Parallelizing network coding on manycore GPU-accelerated system with optimization[J].Computer Engineering and Applications,2014,50(21):79-84.
Authors:TANG Shaohua
Affiliation:Department of Information Engineering, Hunan Engineering Polytechnic, Changsha 410151, China
Abstract:It is well known that network coding has emerged as a promising technique to improve network throughput, balance network loads as well as better utilization of the available bandwidth of networks, in which intermediate nodes are allowed to perform processing operations on the incoming packets other than forwarding packets. But, its potential for practical use has remained to be a challenge, due to its high computational complexity which also severely damages its performance. However, system accelerated by many-core GPU can advance network coding with powerful computing capacity and optimized memory hierarchy from GPU. A fragment-based parallel coding and texture-based parallel decoding are proposed on CUDA-enable GPU. Moreover, random linear coding is parallelizing using CUDA with optimization based on proposed techniques. Experimental results demonstrate a remarkable performance improvement, and prove that it is extraordinarily effective to parallelize network coding on many-core GPU-accelerated system.
Keywords:network coding  Graphic Processing Unit(GPU)  parallelizing  Compute Unified Device Architecture(CUDA)  optimization
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号