首页 | 本学科首页   官方微博 | 高级检索  
     

基于计算统一设备架构的程序优化研究
引用本文:杨云生,张朝晖.基于计算统一设备架构的程序优化研究[J].信息技术,2011(12):51-54,84.
作者姓名:杨云生  张朝晖
作者单位:1. 海军工程大学,武汉,430033
2. 海军指挥学院,南京,211800
摘    要:计算统一设备架构(CUDA)是通用计算领域的生力军,是世界最强计算机的引擎.但由于架构的特殊性,基于CUDA的程序必须进行专门的优化.为使编程人员了解CUDA程序的优化,从编程方法,存储器使用以及指令流优化等方面阐述CUDA程序优化措施的同时,结合一个实例进行了比较测试,测试结果显示经充分优化的程序比优化前快30倍.最后,给出了优化措施的参考排序.

关 键 词:CUDA  程序  优化  信号处理

Research on program optimization based on compute unified device architecture
YANG Yun-sheng , ZHANG Zhao-hui.Research on program optimization based on compute unified device architecture[J].Information Technology,2011(12):51-54,84.
Authors:YANG Yun-sheng  ZHANG Zhao-hui
Affiliation:1.Navy Engineering University,Wuhan 430033,China;2.Navy Command College,Nanjing 211800,China)
Abstract:Compute Unified Device Architecture(CUDA) is a vital new force in the domain of general purpose computing,is also the engine of the most power computer in the world.But because of the particularity of architecture,programs based on CUDA must be optimized specially.In order that programmers understand the optimization steps of CUDA program,the methods of CUDA program optimization are set forth from the aspects of program methods,using memory and optimizing instructions.At the same time,an instance is tested for comparing these methods.The results of tests show that the deeply optimized program runs faster 30 times than it has not optimized.At last,a reference sequence of the optimization methods is presented.
Keywords:CUDA  program  optimization  signal processing
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号