首页 | 本学科首页   官方微博 | 高级检索  
     


Parallel Multigrid for Anisotropic Elliptic Equations
Affiliation:1. Environmental and Fisheries Sciences Division, Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, 2725 Montlake Blvd East, Seattle, WA 98112, USA;2. School of Aquatic and Fishery Sciences, University of Washington, 1122 NE Boat St, Seattle, WA 98105, USA;3. Center for Reproductive Biology, Washington State University, PO Box 647521, Pullman, WA 99164, USA;4. Ocean Associates Inc., Under Contract to Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, 2725 Montlake Blvd East, Seattle, WA 98112, USA;1. Department of Mechanical and Aerospace Engineering, Seoul National University, Seoul 08826, South Korea;2. Institute of Advanced Machines and Design, Seoul National University, Seoul 08826, South Korea;1. Center for Research in Scientific Computation (CRSC) and Department of Mathematics, North Carolina State University, Raleigh, NC 27695, USA;2. Department of Applied Mathematics, National Chiao Tung University, Taiwan;3. School of Mathematical Sciences South China Normal University, Guangzhou 510631, China;4. School of Mathematical Sciences, Nanjing Normal University, Nanjing 210023, China
Abstract:In this paper two well-known robust multigrid solvers for anisotropic operators on structured grids are compared: alternating-plane smoothers combined with full coarsening and plane smoothers combined with semi-coarsening. The study has taken into account not only numerical properties but also architectural ones, focusing on cache memory exploitation and parallel characteristics. Experimental results for the sequential algorithms have been obtained on two different systems based on the MIPS R10000 processor, but with different L2 cache sizes and memory bandwidths (an SGI O2 workstation and an SGI Origin 2000 system). Although the alternating-plane approach is the best choice for sequential implementations, experimental estimations show poor parallel efficiencies. For the semicoarsening alternative two different parallel implementations have been considered. The first one has optimal parallel characteristics but due to deterioration of the convergence properties its realistic efficiency is not satisfactory. In the second one, some processors remain idle during a short period of time on every multigrid cycle. However, the second parallel algorithm is more efficient since it preserves the numerical properties of the sequential version. Parallel experiments have also been taken on a Cray T3E system.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号