Linear pseudosystolic array for partitioned matrix algorithms |
| |
Authors: | Jaime H Moreno Miguel E Figueroa and Tomas Lang |
| |
Affiliation: | (1) Departamento de Ingeniería Eléctrica, Universidad de Concepción, Casilla 53-C, Concepción, Chile;(2) Departamento Arquitectura de Computadores, Univ. Politècnica de Catalunya, Pau Gargallo 5, 08028 Barcelona, Spain |
| |
Abstract: | We describe a class-specific linear pseudosystolic array, withK processing elements, suitable for partitioned execution of matrix algorithms. This array achieves high efficiency, exploits pipelining within cells in a simple manner, has off-cells communication rate lower than computation rate, a small storage inside each cell (whose size is independent of the size of problems), and external storage. This array has been derived from the application of the multimesh graph (MMG) method to a large class of matrix algorithms.Processing elements (cells) use the decoupled access/execute model of computation, which requires two programs in each cell: one controlling the execution of operations and the other the data transfers. All storage modules in the array are accessed as FIFO queues, without the need for addressing mechanisms. We describe the proposed instruction set, which includes single-instruction loops with no overhead, and block-loops with just one extra instruction. Moreover, cells can nest up to three loops with no added overhead. These features are needed for mapping algorithms with the MMG method.Mapping onto this array is illustrated using the LU-decomposition algorithm, and results obtained with other algorithms are also given. Estimates of performance indicate that it is possible to achieve over 85% efficiency, with low requirements in communication bandwidth and storage.This research has been supported in part by Universidad de Concepción (Grant DI-20.92.21, Linear array for matrix algorithms ) and by NSF (Grant MIP-8813340, Composite operations using on-line arithmetic in application-specific parallel architectures ). |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|