首页 | 本学科首页   官方微博 | 高级检索  
     


Discrete fourier transform on multicore
Authors:Franchetti  F Puschel  M Voronenko  Y Chellappa  S Moura  JMF
Affiliation:Carnegie Mellon University;
Abstract:This article gives an overview on the techniques needed to implement the discrete Fourier transform (DFT) efficiently on current multicore systems. The focus is on Intel-compatible multicores, but we also discuss the IBM Cell and, briefly, graphics processing units (GPUs). The performance optimization is broken down into three key challenges: parallelization, vectorization, and memory hierarchy optimization. In each case, we use the Kronecker product formalism to formally derive the necessary algorithmic transformations based on a few hardware parameters. Further code-level optimizations are discussed. The rigorous nature of this framework enables the complete automation of the implementation task as shown by the program generator, Spiral. Finally, we show and analyze DFT benchmarks of the fastest libraries available for the considered platforms.
Keywords:
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号