Performance modeling of hyper-scale custom machine for the principal steps in block Wiedemann algorithm |
| |
Authors: | Tong Zhou Jingfei Jiang |
| |
Affiliation: | 1.National University of Defense Technology,Changsha,China |
| |
Abstract: | Solving large-scale sparse linear systems over GF(2) plays a key role in fluid mechanics, simulation and design of materials, petroleum seismic data processing, numerical weather prediction, computational electromagnetics, and numerical simulation of unclear explosions. Therefore, developing algorithms for this issue is a significant research topic. In this paper, we proposed a hyper-scale custom supercomputer architecture that matches specific data features to process the key procedure of block Wiedemann algorithm and its parallel algorithm on the custom machine. To increase the computation, communication, and storage performance, four optimization strategies are proposed. This paper builds a performance model to evaluate the execution performance and power consumption for our custom machine. The model shows that the optimization strategies result in a considerable speedup, even three times faster than the fastest supercomputer, TH2, while consuming less power. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|