首页 | 本学科首页   官方微博 | 高级检索  
     

CUDA并行程序的内存访问优化技术研究
引用本文:邹岩,杨志义,张凯龙.CUDA并行程序的内存访问优化技术研究[J].计算机测量与控制,2009,17(12):2504-2506.
作者姓名:邹岩  杨志义  张凯龙
作者单位:西北工业大学,计算机学院,陕西,西安,710072
基金项目:国家"十一五"预研项目 
摘    要:对统一计算设备架构CUDA技术进行了研究,分析了CUDA体系结构及其内存访问机制的显著特点,总结了CUDA并行程序常见的内存访问问题,针对全局内存的非对齐访问和共享内存的访问冲突,提出了相应的内存访问优化策略;最后,利用直方图均衡算法对此优化技术进行了测试,对比了优化前后的程序执行时间;实验结果表明,利用此优化技术可以大大缩短CUDA程序的执行时间,并且图像像素越大,优化效果越好。

关 键 词:CUDA  并行计算  优化技术

Study on Optimization Techniques for Memory Accesses of CUDA Parallel Programs
Zou Yan,Yang Zhiyi,Zhang Kailong.Study on Optimization Techniques for Memory Accesses of CUDA Parallel Programs[J].Computer Measurement & Control,2009,17(12):2504-2506.
Authors:Zou Yan  Yang Zhiyi  Zhang Kailong
Affiliation:(College of Computer Science,Northwestern Polytechnical University,Xi’an 710072,China)
Abstract:We analyze the distinct features of CUDA (Compute Unified Device Architecture) and the mechanism of its memory accesses, summarize the representative issues of memory accesses in CUDA parallel programs, and present the optimization strategy aiming at non -coalesced accesses of global memory and bank conflicts of shared memory. Using a histogram equalization algorithm for tests, we compare the execution time of original to optimized programs. The experimental results show that the greater the image pixels, the better the optimization results.
Keywords:CUDA  CUDA  parallel computing  optimization techniques
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号