首页 | 本学科首页   官方微博 | 高级检索  
     

一种快速SIMD浮点乘加器的设计与实现
引用本文:吴铁彬,刘衡竹,杨惠,张剑锋,侯申. 一种快速SIMD浮点乘加器的设计与实现[J]. 计算机工程与科学, 2012, 34(1): 69-73
作者姓名:吴铁彬  刘衡竹  杨惠  张剑锋  侯申
作者单位:国防科学技术大学计算机学院,湖南长沙,410073
基金项目:核高基重大专项(2009ZX01034001006)
摘    要:本文设计和实现了5级全流水SIMD浮点乘加器,支持双精度和双单精度浮点乘法、乘累加(减)操作,用Modelsim和NC Verilog测试和验证了RTL代码实现,基于65nm工艺采用Synopsys公司的Design Complier工具综合硬件实现,运行频率可达714.286MHz。结果表明,相比文献[3]中经典的低延迟乘加结构,在相同综合条件下性能提升了17.89%,面积增加了6.61%,功耗降低了25.08%。

关 键 词:浮点乘法  浮点乘累加  SIMD  双单精度
收稿时间:2011-05-20
修稿时间:2011-10-26

Design and Implementation of a Fast SIMD Floating-Point Fused Multiply-Add Unit
WU Tie-bin , LIU Heng-zhu , YANG Hui , ZHANG Jian-feng , HOU Shen. Design and Implementation of a Fast SIMD Floating-Point Fused Multiply-Add Unit[J]. Computer Engineering & Science, 2012, 34(1): 69-73
Authors:WU Tie-bin    LIU Heng-zhu    YANG Hui    ZHANG Jian-feng    HOU Shen
Affiliation:(School of Computer Science,National University of Defense Technology,Changsha 410073,China)
Abstract:A new 5-stage pipelined architecture of floating-point fused multiply-add (FMAC) unit is proposed and implemented. In this architecture, double precision or double-single precision floating-point multiply,multiply-add and multiply-subtract operations are supported. The unit is implemented to RTL Code, and simulated and verified in Modelsim and NC Verilog. Further more, it is synthesized in the 65nm CMOS technology by Design Complier of Synopsys, and the frequency reaches 714.286MHz.In addition, compared with the conventional low-delay FMAC of paper [3] in the same environment, apart from 6.61 percent of area which could be acceptable is increased, 17.89 percent of delay and 25.08 percent of power is reduced.
Keywords:floating-point multiply  floating-point fused multiply-add (FMAC)  SIMD  double-single precision floating-point
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号