首页 | 本学科首页   官方微博 | 高级检索  
     

基于国产众核超级计算机的6×105核并行矩量法
引用本文:顾宗静,吴昊翔,赵勋旺,林中朝,张玉,张崎.基于国产众核超级计算机的6×105核并行矩量法[J].电子与信息学报,2019,41(4):845-850.
作者姓名:顾宗静  吴昊翔  赵勋旺  林中朝  张玉  张崎
作者单位:西安电子科技大学陕西省超大规模电磁计算重点实验室 西安 710071;西安电子科技大学陕西省超大规模电磁计算重点实验室 西安 710071;西安电子科技大学陕西省超大规模电磁计算重点实验室 西安 710071;西安电子科技大学陕西省超大规模电磁计算重点实验室 西安 710071;西安电子科技大学陕西省超大规模电磁计算重点实验室 西安 710071;西安电子科技大学陕西省超大规模电磁计算重点实验室 西安 710071
基金项目:国家重点研发计划;国家重点研发计划;中国博士后科学基金
摘    要:为实现电磁计算的安全可靠和自主可控,该文基于“天河二号”国产众核超级计算机平台,开展大规模并行矩量法(MoM)的开发工作。为减轻大规模并行计算时计算机集群的通信压力以及加速矩量法积分方程求解,通过分析矩量法电场积分方程离散生成的矩阵具有对角占优特性,提出一种新型LU分解算法,即对角块矩阵选主元LU分解(BDPLU)算法,该算法减少了panel列分解的计算量,更重要的是,完全消除了选主元过程的MPI通信开销。利用BDPLU算法,并行矩量法突破了6×105 CPU核并行规模,这是目前在国产超级计算平台上实现的最大规模的并行矩量法计算,其矩阵求解并行效率可达51.95%。数值结果表明,并行矩量法可准确高效地在国产超级计算平台上解决大规模电磁问题。

关 键 词:矩量法    LU分解    国产超级计算机    6×105
收稿时间:2018-06-04

Parallel MoM Using the Six Hundred Thousand Cores on Domestically-made and Many-core Supercomputer
Zongjing GU,Haoxiang WU,Xunwang ZHAO,Zhongchao LIN,Yu ZHANG,Qi ZHANG.Parallel MoM Using the Six Hundred Thousand Cores on Domestically-made and Many-core Supercomputer[J].Journal of Electronics & Information Technology,2019,41(4):845-850.
Authors:Zongjing GU  Haoxiang WU  Xunwang ZHAO  Zhongchao LIN  Yu ZHANG  Qi ZHANG
Affiliation:Shaanxi Key Laboratory of Large Scale Electromagnetic Computing, Xidian University, Xi’an 710077, China
Abstract:In order to realize safety, reliability and self-control of electromagnetic computing, the large-scale parallel MoM is studied based on domestically-made many-core supercomputer platform named " Tianhe-2”. A new LU decomposition algorithm named Block Diagonal matrix Pivoting LU decomposition (BDPLU) algorithm, is proposed by analyzing the diagonally dominant characteristics of the matrix generated through dispersing electric field integral equation of MoM, for the purpose of communication pressure reduction to computer cluster and solution acceleration to MoM integral equation during large-scale parallel computation. The BDPLU algorithm reduces the amount of calculation in the process of panel factorization. More importantly, the algorithm completely eliminates MPI communication when pivoting. Using BDPLU algorithm, the maximum number of CPU cores break through 6×105 CPU cores, which is the largest scale of parallel MoM computation in domestically-made and many-core supercomputing platform at present, and the parallel efficiency of solving matrix can reach 51.95%. Numerical results show that parallel MoM can accurately and efficiently solve large-scale electromagnetic field problems on domestic supercomputing platform.
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号