首页 | 本学科首页   官方微博 | 高级检索  
     

Hetero-HPL:面向异构高性能计算机的HPL测试程序
引用本文:孙乔,孙家昶,马文静,赵玉文.Hetero-HPL:面向异构高性能计算机的HPL测试程序[J].软件学报,2020,31(7).
作者姓名:孙乔  孙家昶  马文静  赵玉文
作者单位:中国科学院软件研究所 并行软件与计算科学实验室, 北京 100190;中国科学院软件研究所 并行软件与计算科学实验室, 北京 100190;中国科学院软件研究所 计算机科学国家重点实验室, 北京 100190;中国科学院软件研究所 并行软件与计算科学实验室, 北京 100190;中国科学院大学, 北京 100049
基金项目:中国科学院战略性先导科技专项(XDC01030200);国家重点研发计划(2018YFB0204404)
摘    要:HPL(High Performance Linpack)是一套被广泛用于测评计算机性能的测试程序,几十年来学术界及产业界十分关注对HPL测试程序的定制化优化工作,以充分反应同时代新兴计算机平台的性能.面向当今主流多设备异构计算平台,本文尝试为HPL的优化工作提供一种新的解决方案:Hetero-HPL.在Hetero-HPL中,进程不再要求与(协)处理器一一对应,因此HPL算法在单节点独立运行情况下可以完全避免进程间数据传输开销,算法各个重要步骤有能力完全利用物理节点的所有资源,如内存容量,CPU核心,协处理器,PCI-e总线等.Hetero-HPL并不引入冗余计算量及通信量,并在任意设备数量下妥善应对锁页内存分配限制,确保多设备负载均衡和设备内的高效的大规模同质运算.在实验平台上,Hetero-HPL效率可以达到平台峰值性能的76.5%(其中矩阵乘函数效率为84%);进一步的实验表明,Hetero-HPL在多节点联机运行情况下也是一种可行的方案.

关 键 词:Hetero-HPL  多设备异构计算平台  并行计算
收稿时间:2019/8/22 0:00:00
修稿时间:2019/12/5 0:00:00

Hetero-HPL: A Novel Approach for HPL Benchmark on Heterogeneous Architectures
SUN Qiao,SUN Jia-Chang,MA Wen-Jing,ZHAO Yu-Wen.Hetero-HPL: A Novel Approach for HPL Benchmark on Heterogeneous Architectures[J].Journal of Software,2020,31(7).
Authors:SUN Qiao  SUN Jia-Chang  MA Wen-Jing  ZHAO Yu-Wen
Affiliation:Laboratory of Parallel Software and Computational Science, Institute of Software Chinese Academy of Sciences, Beijing 100190, China;Laboratory of Parallel Software and Computational Science, Institute of Software Chinese Academy of Sciences, Beijing 100190, China;State Key Laboratory of Computer Science, Institute of Software Chinese Academy of Sciences, Beijing 100190, China; Laboratory of Parallel Software and Computational Science, Institute of Software Chinese Academy of Sciences, Beijing 100190, China;University of Chinese Academy of Sciences, Beijing 100049, China
Abstract:HPL (HPL Performance Linpack) is a widely used benchmark for measuring computer performance.Over the decades,the practice of optimizing and tuning of HPL has constantly drawn great attention by both industry and academic cicle to evaluate the performace of contemporary cutting-edge computer platforms.For current heterogeneous HPC platforms with multiple accelerating co-processors,a new approach of high-performance HPL benchmark,Hetero-HPL,is proposed in this paper.In Hetero-HPL,the one-to-one match between process set and phsycal (co-)processor set becomes unnessary,so that the computation within each computing node may avoid inter-process message exchange,and each important procedure of the HPL algorithm may make full use of the hardware resourses of each computing node,such as memory,CPU cores,co-processors and PCI-e bus etc.Without redundant compution and communication,the working set of Hetero-HPL is not restricted by the limit of page-locked memory size in a single allocation,and is distributed in a way that the workload is balanced among all the co-processors and massive fine-grained parallelism can be exploited.On one experimental platform with four co-processors,Heter-HPL can reach an efficiency of 76.5% where the efficiency of dgemm is only 84% in one computing node,and further experiment suggests that Hetero-HPL is also a feasible approach in distributed environment.
Keywords:HPL  Multi-device heterogeneous platform  Parallel computing
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号