首页 | 本学科首页   官方微博 | 高级检索  
     

并行时空处理模型下的快速N-body算法
引用本文:王伟,曾栩鸿,王福焕,傅丽丽,曾国荪.并行时空处理模型下的快速N-body算法[J].计算机科学与探索,2011,5(11):1006-1013.
作者姓名:王伟  曾栩鸿  王福焕  傅丽丽  曾国荪
作者单位:同济大学计算机科学与技术系,上海200092;国家高性能计算机工程技术中心同济分中心,上海200092;同济大学嵌入式系统与服务计算教育部重点实验室,上海200092
基金项目:国家自然科学基金No.61103068,61174158; NSFC-微软亚洲研究院联合资助项目No.60970155; 教育部博士点基金No.20090072110035; 上海市优秀学科带头人计划项目No.10XD1404400; 高效能服务器和存储技术国家重点实验室开放基金No.2009HSSA06; 同济大学青年基金No.0800219105,2009KJ030~~
摘    要:图形处理器(graphic processing unit,GPU)的最新发展已经能够以低廉的成本提供高性能的通用计算。基于GPU的CUDA(compute unified device architecture)和OpenCL(open computing language)编程模型为程序员提供了充足的类似于C语言的应用程序接口(application programming interface,API),便于程序员发挥GPU的并行计算能力。采用图形硬件进行加速计算,通过一种新的GPU处理模型——并行时间空间模型,对现有GPU上的N-body实现进行了分析,从而提出了一种新的GPU上快速仿真N-body问题的算法,并在AMD的HD Radeon 5850上进行了实现。实验结果表明,相对于CPU上的实现,获得了400倍左右的加速;相对于已有GPU上的实现,也获得了2至5倍的加速。

关 键 词:N-body  并行计算  通用图形处理器(GPGPU)  时间空间模型
修稿时间: 

Parallel Time-Space Processing Model Based Fast N-body Simulation
WANG Wei,ZENG Xuhong,WANG Fuhuan,FU Lili,ZENG Guosun.Parallel Time-Space Processing Model Based Fast N-body Simulation[J].Journal of Frontier of Computer Science and Technology,2011,5(11):1006-1013.
Authors:WANG Wei  ZENG Xuhong  WANG Fuhuan  FU Lili  ZENG Guosun
Affiliation:1. Department of Computer Science and Engineering, Tongji University, Shanghai 200092, China 2. Tongji Branch, National Engineering & Technology Center of High Performance, Shanghai 200092, China 3. Key Laboratory of Embedded System and Service Computing, Ministry of Education, Tongji University, Shanghai 200092, China
Abstract:With the development of graphic processing unit (GPU), the general processing with high performance can be achieved with low cost. The GPU based compute unified device architecture (CUDA) and open computing language (OpenCL) programming model provide adequate application programming interfaces (APIs) similar to C language, which can be utilized by programmer with the power of GPU parallel processing. This paper presents a novel parallel implementation algorithm of N-body gravitational simulation on GPU. The algorithm uses graphics hardware to accelerate computation, and is optimized to N-body computation based on parallel time-space processing model (PTPM) on GPUs. The paper also analyzes the current implementations of GPU, and gives a new method on implementing N-body algorithm on HD Radeon 5850 GPU of AMD. Experimental results show that this method achieves an acceleration of 400 times compared with CPU, and an acceleration up to 2-5 times compared with GPU.
Keywords:N-body  parallel computing  general purpose graphic processing unit (GPGPU)  time-space model
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机科学与探索》浏览原始摘要信息
点击此处可从《计算机科学与探索》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号