首页 | 本学科首页   官方微博 | 高级检索  
     

蒙特卡罗临界计算粒子并行动态负载均衡调权算法研究
引用本文:付元光,刘鹏,李瑞,王鑫,邓力.蒙特卡罗临界计算粒子并行动态负载均衡调权算法研究[J].原子能科学技术,2022,56(2):316-325.
作者姓名:付元光  刘鹏  李瑞  王鑫  邓力
作者单位:中国工程物理研究院 高性能数值模拟软件中心,北京100088;北京应用物理与计算数学研究所,北京100088
基金项目:国家自然科学基金(11805017,12001050);
摘    要:本文针对蒙特卡罗临界计算,提出了一种具有动态负载平衡能力的粒子权重调节算法。该算法在每次源迭代初始,将模拟粒子总权重均分到每个进程上,在保证每个进程本地权重恒定的前提下,分别调节不同进程的粒子出生权重,实现了源迭代过程中进程间计算的动态负载平衡。与Master Slave、Nearest Neighbor等算法进行比对,发现此算法加速效果更好。将此算法集成到JMCT程序中,通过BEAVRS基准题测试了并行效率,相对于128进程,4 800进程弱可扩和强可扩展并行效率分别可达92.54%和81.47%。

关 键 词:蒙特卡罗    临界计算    粒子并行    负载均衡

Study on Load Balanced Particle Weight Adjustment Algorithm for Particle Parallel Monte Carlo Criticality Calculation
FU Yuanguang,LIU Peng,LI Rui,WANG Xin,DENG Li.Study on Load Balanced Particle Weight Adjustment Algorithm for Particle Parallel Monte Carlo Criticality Calculation[J].Atomic Energy Science and Technology,2022,56(2):316-325.
Authors:FU Yuanguang  LIU Peng  LI Rui  WANG Xin  DENG Li
Affiliation:Software Center for High Performance Numerical Simulation, China Academy of Engineering Physics, Beijing 100088, China;Institute of Applied Physics and Computational Mathematics, Beijing 100088, China
Abstract:During the source iteration process of Monte Carlo criticality calculation with particle parallel, there exists stochastic fluctuate which may cause unbalanced allocation of number of fission banks. Without inducing bank adjustment algorithm, a fewer particle number per cycle with more cycles would lead to a poorer load balance. The Master Slave algorithm keeps one process to gather, reallocate and broadcast all fission banks from and to other running process at the end of each iteration cycle, which would guarantee an equal load allocation. However, large amount of data transmission and frequent operations of gather and broadcast would cause a low parallel efficiency. A poor acceleration ratio is found especially in the case with fewer particle numbers per cycle and more cycles for Master Slave. Instead, for the Nearest Neighbor algorithm, one process keeps a great proportion of particles locally, and only sends a small proportion to its nearest adjacent processes, which greatly reduce the amount of data to transfer. A better acceleration ratio was found compared with Master-Slave, which is not sensitive to different settings of particle and cycle number. In this work, a new load balanced algorithm was proposed. Instead of transmitting data among processes, this algorithm uses weight adjustment scheme. In this algorithm, total particle weight is split equally to each process at the beginning of simulation. The local total weight always keeps being constant as cycles proceed, but the local single particle born weight is adjusted based on the number of local fission banks in each process. As each process deals with equal particle weight, a good load balance can be achieved. As each process does operations separately, no fission bank data need to be transmitted, which leads to a good parallel efficiency. The defect is that it cannot keep consistency between serial and parallel results, which is not a severe case to stochastic simulation. A simple PWR pin cell problem and a 2×2 PWR assembly problem were used to test acceleration ability of different algorithm, with single process to 120 processes was used. It is found that new algorithm achieves a higher acceleration ratio compared to Master Slave and Nearest Neighbor in different settings of particle and cycle number. Further, BEAVRS whole core problem was used on Tianhe Ⅱ supercomputer to test the weak and strong scaling parallel efficiency of the new algorithm, with 9254% and 8147% respectively of 4 800 processes relative to 128 processes.
Keywords:Monte Carlo                                                                                                                        criticality calculation                                                                                                                        particle parallel                                                                                                                        load balance
点击此处可从《原子能科学技术》浏览原始摘要信息
点击此处可从《原子能科学技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号