首页 | 本学科首页   官方微博 | 高级检索  
     

RSA算法的CUDA高效实现技术
引用本文:孙迎红,童元满,王志英.RSA算法的CUDA高效实现技术[J].计算机工程与应用,2011,47(2):84-87.
作者姓名:孙迎红  童元满  王志英
作者单位:1. 湖南涉外经济学院,计算机科学与技术系,长沙,410205
2. 国防科学技术大学,计算机学院,长沙,410073
基金项目:国家自然科学基金,湖南省教育厅资助项目
摘    要:CUDA(Compute Unified Device Architecture)作为一种支持GPU通用计算的新型计算架构,在大规模数据并行计算方面得到了广泛的应用。RSA算法是一种计算密集型的公钥密码算法,给出了基于CUDA的RSA算法并行化高效实现技术,其关键为引入大量独立并发的Montgomery模乘线程,并给出了具体的线程组织、数据存储结构以及基于共享内存的性能优化实现技术。根据RSA算法CUDA实现方法,在某款GPU上测试了RSA算法的运算性能和吞吐率。实验结果表明,与RSA算法的通用CPU实现方式相比,CUDA实现能够实现超过40倍的性能加速。

关 键 词:统一计算设备架构  RSA算法  Montgomery模乘
收稿时间:2009-9-14
修稿时间:2009-11-3  

CUDA based high performance implementation of RSA algorithm
SUN Yinghong,TONG Yuanman,WANG Zhiying.CUDA based high performance implementation of RSA algorithm[J].Computer Engineering and Applications,2011,47(2):84-87.
Authors:SUN Yinghong  TONG Yuanman  WANG Zhiying
Affiliation:1.Department of Computer Science and Technology,Hunan International Economics University,Changsha 410205,China 2.School of Computer Science,National University of Defense Technology,Changsha 410073,China
Abstract:As a new architecture supporting general purpose computing on GPU,Compute Unified Device Architecture(CU-DA) plays an important role in massive data parallel computing.RSA is a kind of computing concentrated public key cryptographic algorithm.To improve the performance of RSA algorithm,CUDA based high performance implementation is presented in this paper.The key of CUDA implementation of RSA is large amount of independent parallel Montgomery modular multiplication thread in the kernel side.The threads organization scheme and data structure of this implementation are also presented.Besides,shared memory based performance improvement method is also presented.According to the implementation method of this paper,the performance and throughput of RSA algorithm are obtained for a CUDA GPU.The experiment results show that the CUDA implementation can achieve more than 40 times speedup in comparison with general CPU implementation of RSA.
Keywords:Compute Unified Device Architecture(CUDA)  RSA algorithm  Montgomery module multiplication
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号