首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于SRT-8算法的SIMD浮点除法器的设计与实现
引用本文:邓子椰,陈书明,彭元喜,雷元武.一种基于SRT-8算法的SIMD浮点除法器的设计与实现[J].计算机工程与科学,2014,36(5):797-803.
作者姓名:邓子椰  陈书明  彭元喜  雷元武
摘    要:在科学计算、数字信号处理、通信和图像处理等应用中,除法运算是常用的基本操作之一。基于SRT 8除法算法,设计一个SIMD结构的IEEE 754标准浮点除法器,在同一硬件平台上能够实现双精度浮点除法和两个并行的单精度浮点除法。通过优化SRT 8迭代除法结构,提出商选择和余数加法的并行处理,并采用商数字存储技术降低迭代除法的计算延时,提高频率。同时,采用复用策略减少硬件资源开销,节省面积。实验表明,在40nm工艺下,本设计综合cell面积为18601.9681 μm2,运行频率可达2.5GHz,相对传统的SRT 8实现关键延迟减少了23.81%。

关 键 词:SRT-8  SIMD  浮点除法器  双精度浮点  SIMD单精度浮点  
收稿时间:2013-07-05
修稿时间:2014-05-25

Design and implementation of a SIMD floating-point divider based on SRT-8
DENG Zi ye,CHEN Shu ming,PENG Yuan xi,LEI Yuan wu.Design and implementation of a SIMD floating-point divider based on SRT-8[J].Computer Engineering & Science,2014,36(5):797-803.
Authors:DENG Zi ye  CHEN Shu ming  PENG Yuan xi  LEI Yuan wu
Affiliation:(College of Computer, National University of Defense Technology,Changsha 410073,China)
Abstract:In the area of scientific computing, digital signal processing, communication and image processing, division is one of the widely used basic operations. Based on SRT-8 algorithm, a SIMD floating-point divider is designed,which is compatible to IEEE-754 standard.The divider supports one double precision floating point division and two parallel single precision floating point division on the same hardware platform.It reduces the iterative division calculation time delay and improves the frequency by optimizing the SRT 8 iterative division structure,choosing parallel processing of quotient and residue addition,and adopting rapid storage technique. Besides,it reduces hardware resources and saves area by adopting reuse strategy.Experiments show that the synthesized cell area is 18 601.968 1μm2 and the frequency reaches up to 2.5GHz with 40nm technology library,and the latency of operation is reduced by 23.81% in comparison to the traditional implementation based on SRT-8.
Keywords:SRT-8  SIMD  floating-point division  double precision floating-point  SIMD single precision floating-point  
本文献已被 CNKI 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号