A CPU-GPU hybrid approach for the unsymmetric multifrontal method |
| |
Authors: | Chenhan D YuWeichung Wang Dan’l Pierce |
| |
Affiliation: | a Department of Mathematics, National Taiwan University, Taipei 10617, Taiwan b MSC. Software Corporation, Glendale, CA 91203, USA |
| |
Abstract: | Multifrontal is an efficient direct method for solving large-scale sparse and unsymmetric linear systems. The method transforms a large sparse matrix factorization process into a sequence of factorizations involving smaller dense frontal matrices. Some of these dense operations can be accelerated by using a graphic processing unit (GPU). We analyze the unsymmetric multifrontal method from both an algorithmic and implementational perspective to see how a GPU, in particular the NVIDIA Tesla C2070, can be used to accelerate the computations. Our main accelerating strategies include (i) performing BLAS on both CPU and GPU, (ii) improving the communication efficiency between the CPU and GPU by using page-locked memory, zero-copy memory, and asynchronous memory copy, and (iii) a modified algorithm that reuses the memory between different GPU tasks and sets thresholds to determine whether certain tasks be performed on the GPU. The proposed acceleration strategies are implemented by modifying UMFPACK, which is an unsymmetric multifrontal linear system solver. Numerical results show that the CPU-GPU hybrid approach can accelerate the unsymmetric multifrontal solver, especially for computationally expensive problems. |
| |
Keywords: | Sparse and unsymmetric linear systems Multifrontal CPU-GPU hybrid approach Parallel computing |
本文献已被 ScienceDirect 等数据库收录! |
|