首页 | 本学科首页   官方微博 | 高级检索  
     

非结构网格上求解中子输运方程的并行流水线Sn扫描算法
引用本文:莫则尧,傅连祥,阳述林. 非结构网格上求解中子输运方程的并行流水线Sn扫描算法[J]. 计算机学报, 2004, 27(5): 587-595
作者姓名:莫则尧  傅连祥  阳述林
作者单位:北京应用物理与计算数学研究所,北京,100088;北京应用物理与计算数学研究所,北京,100088;北京应用物理与计算数学研究所,北京,100088
基金项目:本课题得到国家自然科学基金(60273030)、国家"九七三"重点基础研究发展规划项目基金(G1999032805)和中物院重点基金资助.
摘    要:间断有限元离散纵标方法(Sn)是广泛应用于求解高维非定常中子输运方程的数值方法,它涉及几何网格空间、速度相空间和中子能群的离散,计算量很大.该文基于非结构网格,提出了基于区域分解的并行流水线Sn扫描算法,通过设计具有不同内在并行度和通信面体比的区域分解方法和队列插入算法,对两个不同物理模型,分别使用两台并行机的92个和256个CPU,获得72倍和78倍以上的加速.可扩展性能分析表明,算法的性能非常依赖于并行机的点对点通信延迟.

关 键 词:非结构网格  中子输运方程  并行流水线Sn扫描算法

Parallel Pipelined Sn Sweeping Algorithm for Neutron Transport on Unstructured Grid
MO Ze-Yao FU Lian-Xiang YANG Shu-Lin. Parallel Pipelined Sn Sweeping Algorithm for Neutron Transport on Unstructured Grid[J]. Chinese Journal of Computers, 2004, 27(5): 587-595
Authors:MO Ze-Yao FU Lian-Xiang YANG Shu-Lin
Abstract:Discontinuous finite element discrete ordinates numerical method is widely used to solve high dimensional time dependent neutron transport equation in recent years, and it needs huge calculations arising from discretizations of time, geometric space, velocity angle direction and neutron energy. However, the scalable parallel computing of such numerical method is challenging especially on unstructured grid because of the intrinsic data dependence of neutron flux refreshment on elements along each discrete velocity angle direction. For the realistic applications under cylinder symmetrical coordinate system, many related works have implied that domain decomposition is the unique strategy for scalable parallel computing of such numerical method. This paper presents a parallel pipelined S n sweeping algorithm with scalable performance on domain decomposition of unstructured grid. By well incorporating different domain decomposition methods with different priority queuing algorithms for ordering of internal grid elements in each decomposed subdomain, the MPI implementation of this algorithm has achieved speedup larger than 72 with 92 processors in a distributed shared memory parallel computer with network latency less than 2 microseconds and speedup larger than 77 with 256 processors in another MPP machine with network latency equal to 10 microseconds. Except from the numerical experiments, theoretical scalability analysis of this algorithm is also given in this paper, and it shows that the scalability of this algorithm heavily depends on the point-to-point network latency of parallel computers.
Keywords:unstructured grid  neutron transport  parallel pipelined S n sweeping algorithm  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号