首页 | 本学科首页   官方微博 | 高级检索  
     

使用GPU加速分子动力学模拟中的非绑定力计算
引用本文:吴强,杨灿群,葛振,陈娟.使用GPU加速分子动力学模拟中的非绑定力计算[J].计算机工程与科学,2009,31(Z1).
作者姓名:吴强  杨灿群  葛振  陈娟
作者单位:国防科技大学计算机学院,湖南,长沙,410073
基金项目:国家863计划资助项目,国家自然科学基金资助项目 
摘    要:在分子动力学模拟(MD)中,对非绑定力的计算需要花费大量的时间。本文提出了基于CUDA和Brook+的两种双精度算法,分别在NVIDIA和AMD两款主流GPU上实现了非绑定力的计算,借助GPU的计算能力加速了整个MD程序。算法对MD进行了任务分割,采用区域分解的方法将非绑定力的计算映射到GPU的计算核心上,同时针对两款GPU的各自特点提出了线程块内共享存储、最小化数据集两种优化方法。性能测试结果表明,与Intel Xeon 2.6GHzCPU的单核相比,43.2万粒子的高速粒子碰撞模拟,在配置NVIDIA Tesla C1060的系统上性能提高了6.5倍,在配置AMD HD4870的系统上性能提高了4.8倍。

关 键 词:GPU  分子动力学模拟  CUDA  Brook+

GPU Acceleration of Nonbonded Forces for Molecular Dynamics Simulation
WU Qiang,YANG Can-qun,GE Zhen,CHENG Juan.GPU Acceleration of Nonbonded Forces for Molecular Dynamics Simulation[J].Computer Engineering & Science,2009,31(Z1).
Authors:WU Qiang  YANG Can-qun  GE Zhen  CHENG Juan
Abstract:The most costly computation required by molecular dynamics (MD) is the calculation of non-bonded forces. This paper presents two algorithms based on CUDA and Brook+ to accelerate the calculation of non-bonded forces and then examines them on two popular GPUs of NVIDIA and AMD. The algorithms enable fine-grained task decomposition and spatial decomposition of forces that map efficiently to the compute units of GPU. They also present optimization methods such as use of shared memory between threads and strip-mined. We then test our implementations for 432 thousand atoms. Our code executed on Tesla C1060 runs 6.5 times faster than the original application running on single core of Intel Intel Xeon 2.6GHZ CPU, while on HD4870 it runs 4.8 times faster.
Keywords:GPU  CUDA  Brook+
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号