首页 | 本学科首页   官方微博 | 高级检索  
     


A Framework for Generating Distributed-Memory Parallel Programs for Block Recursive Algorithms
Authors:S K S Gupta  C -H Huang  P Sadayappan  R W Johnson  
Affiliation:aSchool of Electrical Engineering and Computer Science, Ohio University, Athens, Ohio, 45701;bDepartment of Computer Science, Duke University, Durham, North Carolina, 27708-0129;cDepartment of Computer and Information Science, The Ohio State University, Columbus, Ohio, 43210;dDepartment of Computer Science, St. Cloud State University, St. Cloud, Minnesota, 56301
Abstract:A framework for synthesizing communication-efficient distributed-memory parallel programs for block recursive algorithms such as the fast Fourier transform (FFT) and Strassen's matrix multiplication is presented. This framework is based on an algebraic representation of the algorithms, which involves the tensor (Kronecker) product and other matrix operations. This representation is useful in analyzing the communication implications of computation partitioning and data distributions. The programs are synthesized under two different target program models. These two models are based on different ways of managing the distribution of data for optimizing communication. The first model uses point-to-point interprocessor communication primitives, whereas the second model uses data redistribution primitives involving collective all-to-many communication. These two program models are shown to be suitable for different ranges of problem size. The methodology is illustrated by synthesizing communication-efficient programs for the FFT. This framework has been incorporated into the EXTENT system for automatic generation of parallel/vector programs for block recursive algorithms.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号