一种快速SIMD浮点乘加器的设计与实现 Design and Implementation of a Fast SIMD Floating-Point Fused Multiply-Add Unit期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种快速SIMD浮点乘加器的设计与实现

引用本文：	吴铁彬,刘衡竹,杨惠,张剑锋,侯申. 一种快速SIMD浮点乘加器的设计与实现[J]. 计算机工程与科学, 2012, 34(1): 69-73

作者姓名：	吴铁彬刘衡竹杨惠张剑锋侯申

作者单位：	国防科学技术大学计算机学院,湖南长沙,410073

基金项目：	核高基重大专项(2009ZX01034001006)

摘要：	本文设计和实现了5级全流水SIMD浮点乘加器,支持双精度和双单精度浮点乘法、乘累加(减)操作,用Modelsim和NC Verilog测试和验证了RTL代码实现,基于65nm工艺采用Synopsys公司的Design Complier工具综合硬件实现,运行频率可达714.286MHz。结果表明,相比文献[3]中经典的低延迟乘加结构,在相同综合条件下性能提升了17.89%,面积增加了6.61%,功耗降低了25.08%。
关键词：	浮点乘法浮点乘累加 SIMD 双单精度
收稿时间：	2011-05-20
修稿时间：	2011-10-26
Design and Implementation of a Fast SIMD Floating-Point Fused Multiply-Add Unit

WU Tie-bin , LIU Heng-zhu , YANG Hui , ZHANG Jian-feng , HOU Shen. Design and Implementation of a Fast SIMD Floating-Point Fused Multiply-Add Unit[J]. Computer Engineering & Science, 2012, 34(1): 69-73

Authors:	WU Tie-bin LIU Heng-zhu YANG Hui ZHANG Jian-feng HOU Shen

Affiliation:	(School of Computer Science,National University of Defense Technology,Changsha 410073,China)

Abstract:	A new 5-stage pipelined architecture of floating-point fused multiply-add (FMAC) unit is proposed and implemented. In this architecture, double precision or double-single precision floating-point multiply,multiply-add and multiply-subtract operations are supported. The unit is implemented to RTL Code, and simulated and verified in Modelsim and NC Verilog. Further more, it is synthesized in the 65nm CMOS technology by Design Complier of Synopsys, and the frequency reaches 714.286MHz.In addition, compared with the conventional low-delay FMAC of paper [3] in the same environment, apart from 6.61 percent of area which could be acceptable is increased, 17.89 percent of delay and 25.08 percent of power is reduced.

Keywords:	floating-point multiply floating-point fused multiply-add (FMAC) SIMD double-single precision floating-point
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《计算机工程与科学》浏览原始摘要信息
	点击此处可从《计算机工程与科学》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏