首页 | 本学科首页   官方微博 | 高级检索  
     


Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters
Authors:Chao-Tung Yang  Chih-Lin Huang  Cheng-Fang Lin
Affiliation:Department of Computer Science, Tunghai University, Taichung City, 40704, Taiwan
Abstract:Nowadays, NVIDIA's CUDA is a general purpose scalable parallel programming model for writing highly parallel applications. It provides several key abstractions – a hierarchy of thread blocks, shared memory, and barrier synchronization. This model has proven quite successful at programming multithreaded many core GPUs and scales transparently to hundreds of cores: scientists throughout industry and academia are already using CUDA to achieve dramatic speedups on production and research codes. In this paper, we propose a parallel programming approach using hybrid CUDA OpenMP, and MPI programming, which partition loop iterations according to the number of C1060 GPU nodes in a GPU cluster which consists of one C1060 and one S1070. Loop iterations assigned to one MPI process are processed in parallel by CUDA run by the processor cores in the same computational node.
Keywords:CUDA   GPU   MPI   OpenMP   Hybrid   Parallel programming
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号