A dissection solver with kernel detection for symmetric finite element matrices on shared memory computers |
| |
Authors: | A. Suzuki F.‐X. Roux |
| |
Affiliation: | 1. Laboratoire Jacques‐Louis Lions, Université Pierre et Marie Curie, , 75252 PARIS Cedex 05, France;2. ONERA, Chemin de la Hunière, , FR‐91761 PALAISEAU Cedex, France |
| |
Abstract: | A direct solver for symmetric sparse matrices from finite element problems is presented. The solver is supposed to work as a local solver of domain decomposition methods for hybrid parallelization on cluster systems of multi‐core CPUs, and then it is required to run on shared memory computers and to have an ability of kernel detection. Symmetric pivoting with a given threshold factorizes a matrix with a decomposition introduced by a nested bisection and selects suspicious null pivots from the threshold. The Schur complement constructed from the suspicious null pivots is examined by a factorization with 1 × 1 and 2 × 2 pivoting and by a robust kernel detection algorithm based on measurement of residuals with orthogonal projections onto supposed image spaces. A static data structure from the nested bisection and a block sub‐structure for Schur complements at all bisection levels can use level 3 BLAS routines efficiently. Asynchronous task execution for each block can reduce idle time of processors drastically, and as a result, the solver has high parallel efficiency. Competitive performance of the developed solver to Intel Pardiso on shared memory computers is shown by numerical experiments. Copyright © 2014 John Wiley & Sons, Ltd. |
| |
Keywords: | kernel detection finite element matrix nested dissection level 3 BLAS asynchronous task execution |
|
|