首页 | 本学科首页   官方微博 | 高级检索  
     


A Loop Transformation Algorithm for Communication Overlapping
Authors:Kazuaki Ishizaki  Hideaki Komatsu  Toshio Nakatani
Affiliation:(1) 1623-14, Shimotsuruma, Yamato-shi, Kanagawa-ken, 242-8502, Japan
Abstract:Overlapping communication with computation is a well-known approach to improving performance. Previous research has focused on optimizations performed by the programmer. This paper presents a compiler algorithm that automatically determines the appropriate loop indices of a given nested loop and applies loop interchange and tiling in order to overlap communication with computation. The algorithm avoids generating redundant communication by providing a framework for combining information on data dependence, communication, and reuse. It also describes a method of generating messages to exchange data between processors for tiled loops on distributed memory machines. The algorithm has been implemented in our High Performance Fortran (HPF) compiler, and experimental results have shown its effectiveness on distributed memory machines, such as the RISC System/6000 Scalable POWERparallel System. This paper also discusses the architectural problems of efficient optimization.
Keywords:compiler  distributed memory machines  overlapping communication with computation  loop transformations
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号