Communication Benchmarking and Performance Modelling of MPI Programs on Cluster Computers期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Communication Benchmarking and Performance Modelling of MPI Programs on Cluster Computers

Authors:	D.?A.?Grove author-information" > author-information__contact u-icon-before" > mailto:duncan@cs.adelaide.edu.au" title=" duncan@cs.adelaide.edu.au" itemprop=" email" data-track=" click" data-track-action=" Email author" data-track-label=" " >Email author,P.?D.?Coddington

Affiliation:	(1) School of Computer Science, University of Adelaide, Adelaide, SA, 5005, Australia

Abstract:	This paper gives an overview of two related tools that we have developed to provide more accurate measurement and modelling of the performance of message-passing communication and application programs on distributed memory parallel computers. MPIBench uses a very precise, globally synchronised clock to measure the performance of MPI communication routines. It can generate probability distributions of communication times, not just the average values produced by other MPI benchmarks. This allows useful insights to be made into the MPI communication performance of parallel computers, and in particular how performance is affected by network contention. The Performance Evaluating Virtual Parallel Machine (PEVPM) provides a simple, fast and accurate technique for modelling and predicting the performance of message-passing parallel programs. It uses a virtual parallel machine to simulate the execution of the parallel program. The effects of network contention can be accurately modelled by sampling from the probability distributions generated by MPIBench. These tools are particularly useful on clusters with commodity Ethernet networks, where relatively high latencies, network congestion and TCP problems can significantly affect communication performance, which is difficult to model accurately using other tools. Experiments with example parallel programs demonstrate that PEVPM gives accurate performance predictions on commodity clusters. We also show that modelling communication performance using average times rather than sampling from probability distributions can give misleading results, particularly for programs running on a large number of processors.

Keywords:	parallel computing cluster computing performance modelling
本文献已被 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏