An FFT Performance Model for Optimizing General-Purpose Processor Architecture |
| |
Authors: | Ling Li Yun-Ji Chen Dao-Fu Liu Cheng Qian Wei-Wu Hu |
| |
Affiliation: | (1) Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China;(2) Loongson Technologies Corporation Limited, Beijing, 100190, China |
| |
Abstract: | General-purpose processor (GPP) is an important platform for fast Fourier transform (FFT), due to its flexibility, reliability and practicality. FFT is a representative application intensive in both computation and memory access, optimizing the FFT performance of a GPP also benefits the performances of many other applications. To facilitate the analysis of FFT, this paper proposes a theoretical model of the FFT processing. The model gives out a tight lower bound of the runtime of FFT on a GPP, and guides the architecture optimization for GPP as well. Based on the model, two theorems on optimization of architecture parameters are deduced, which refer to the lower bounds of register number and memory bandwidth. Experimental results on different processor architectures (including Intel Core i7 and Godson-3B) validate the performance model. |
| |
Keywords: | |
本文献已被 CNKI SpringerLink 等数据库收录! |
|