首页 | 本学科首页   官方微博 | 高级检索  
     


Tuning the Schur complement computations for finite element partitions
Authors:G. P. Nikishkov   H. Kanda  A. Makinouchi
Affiliation:

a Center for Aerospace Research and Education, University of California at Los Angeles, 7704 Boelter Hall, Los Angeles, CA 90095-1600, USA

b The University of Aizu, Department of Computer Software, Aizu-Wakamatsu City, Fukushima 965-8580, Japan

c Materials Fabrication Laboratory, Institute of Physical and Chemical Research—RIKEN, Wako, Saitama 351-01, Japan

Abstract:The domain decomposition method (DDM) is an efficient algorithmic tool for the parallelization of finite element computer codes. A variant of the DDM with direct solution algorithm is based on computation of Schur complement matrices for finite element partitions. This paper describes a simple technique that considerably improves execution rate of computationally intensive routines of the Schur complement computations. The technique uses ‘block of columns’ matrix operations and loop unrolling to reduce load instructions from cache memory and to increase instruction-level parallelism. For superscalar RISC processors, experimental results show that it is possible to improve performance of the DDM solution procedure by several times.
Keywords:Domain decomposition   Finite element method   Parallel   Schur complement
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号