首页 | 本学科首页   官方微博 | 高级检索  
     

基于Intel SIMD指令的二维FFT优化算法
引用本文:李成军,周卫峰,朱重光.基于Intel SIMD指令的二维FFT优化算法[J].计算机工程与应用,2007,43(5):41-44.
作者姓名:李成军  周卫峰  朱重光
作者单位:[1]中国科学院研究生院,北京100049 [2]中国科学院遥感应用研究所,北京100101
摘    要:在基于频域的大数据量图像处理算法中,最为耗时的步骤就是对图像数据进行二维FFT变换的过程。论文针对这一问题,提出一种基于Intel SIMD指令的二维FFT优化算法。通过将数据按照便于SIMD指令计算的方式进行组织,利用SSE3指令加速复数乘法,在二维处理中针对处理器缓存进行优化等方法,实现了很高的性能。实验结果表明:描述的算法比目前使用最广泛的公共域FFT程序包FFTW快30%左右。达到了对大数据量图像进行快速处理的要求,具有较大的工程实用价值。

关 键 词:大数据量图像处理  二维FFT  SIMD  SSE/SSE3
文章编号:1002-8331(2007)05-0041-04
修稿时间:2006-05

Optimal 2D FFT algorithm based on intel SIMD instructions
LI Cheng-jun,ZHOU Wei-feng,ZHU Chong-guang.Optimal 2D FFT algorithm based on intel SIMD instructions[J].Computer Engineering and Applications,2007,43(5):41-44.
Authors:LI Cheng-jun  ZHOU Wei-feng  ZHU Chong-guang
Affiliation:1.Graduate School of the Chinese Academy of Sciences,Beijing 100049,China; 2.Institute of Remote Sensing Application,Chinese Academy of Sciences,Beijing 100101,China
Abstract:In the large-scale image processing algorithms based on frequency domain method,the most time-consumlng part is playing 2D FFT on the image data.In this paper an optimal 2D FFT algorithm based on Intel SIMD technology is presented to solve this problem.Very high performance has been achieved by arranging data layout to benefit from SIMD instructions,using SSE3 instructions to accelerate complex number multiplications and optimizing the cache usage in the 2D case,etc.The result of the experiment demonstrates that the presented algorithm is about 30% faster than the abroad used public domain FFT package FFTW.The algorithm has reached the demand of fast large-scale image processing.
Keywords:large-scale image processing  2D FFT  SIMD  SSE/SSE3
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号