首页 | 本学科首页   官方微博 | 高级检索  
     

基于高精度乘累加的LU分解加速器的设计
引用本文:雷元武,窦勇,郭松,李鑫,雷国庆. 基于高精度乘累加的LU分解加速器的设计[J]. 计算机工程与科学, 2009, 31(11). DOI: 10.3969/j.issn.1007-130X.2009.11.009
作者姓名:雷元武  窦勇  郭松  李鑫  雷国庆
作者单位:国防科技大学计算机学院,湖南,长沙,410073;国防科技大学计算机学院,湖南,长沙,410073;国防科技大学计算机学院,湖南,长沙,410073;国防科技大学计算机学院,湖南,长沙,410073;国防科技大学计算机学院,湖南,长沙,410073
基金项目:国家自然科学基金资助项目 
摘    要:本文首先分析LU分解中舍入误差的积累过程,建立精度损失与矩阵规模的关系模型来预测大规模LU分解的精度;然后,根据定点加法的简单、快速、无精度损失的特点,设计高精度乘累加器(HPMAcc),并基于此实现一个细粒度并行LU分解加速器。实验结果表明,和高精度软件库QD或MPFR相比,4PE结构的LU分解加速器能够取得100倍的加速比,同时取得90多位的计算精度。

关 键 词:舍入误差  LU分解  高精度乘累加

Design of a LU Decomposition Accelerator Based on High-Precision Multiplying and Accumulating
LEI Yuan-wu,DOU Yong,GUO Song,LI Xin,LEI Guo-qing. Design of a LU Decomposition Accelerator Based on High-Precision Multiplying and Accumulating[J]. Computer Engineering & Science, 2009, 31(11). DOI: 10.3969/j.issn.1007-130X.2009.11.009
Authors:LEI Yuan-wu  DOU Yong  GUO Song  LI Xin  LEI Guo-qing
Abstract:In this paper we analyze the course of rounding error accumulation in the LU decomposition, and create a model, between the loss accuracy of the result and the scale of matrix, to predict the accuracy of large scale LU decomposi-tions. Then, we design a high-precision multiplying-accumulating (HPMAcc) unit in terms of the features of the simple, fast and error-free fixed-point add, and a fine-grain parallel LU decomposition accelerator based on this multiplying-accumu-lating unit Compared to the implementation of a high-precision software library such as QD or MPFR, the speed-up factors up to more than 100 are obtained. Meanwhile, more than 90 bits of accuracy can be achieved.
Keywords:rounding error  LU decomposition  high-precision multiply and accumulate
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号