首页 | 本学科首页   官方微博 | 高级检索  
     

SIMD向量指令的非满载使用方法研究
引用本文:徐金龙 赵荣彩 赵 博. SIMD向量指令的非满载使用方法研究[J]. 计算机科学, 2015, 42(7): 229-233
作者姓名:徐金龙 赵荣彩 赵 博
作者单位:信息工程大学数学工程与先进计算国家重点实验室 郑州450001
基金项目:本文受国家高技术研究发展计划(863)(2009AA01220),“核高基”重大专项(2009zx10036-001-001)资助
摘    要:大规模SIMD体系结构提供了更强的向量并行硬件支持,但是,大量迭代次数不足的循环由于不能提供足够的并行性,难以用等价的向量方式实现。为了更有效地利用SIMD,提出了一种非满载地使用SIMD指令的向量化方法。研究了向量寄存器的使用方式,基于非满载的向量寄存器使用方式实现了非满载的向量操作和短循环的向量化,并将非满载的向量化方法用于一般循环的向量化。提供了收益分析方法来为本向量化方法作精确指导。实验结果表明了该方法的有效性,所选测试用例的目标循环被向量化,平均加速比达到1.2。

关 键 词:大规模SIMD  并行  向量化  非满载向量操作  收益分析

Research on Non-full Length Usage of SIMD Vector Instruction
XU Jin-long ZHAO Rong-cai ZHAO Bo. Research on Non-full Length Usage of SIMD Vector Instruction[J]. Computer Science, 2015, 42(7): 229-233
Authors:XU Jin-long ZHAO Rong-cai ZHAO Bo
Affiliation:State Key Laboratory of Mathematical Engineering and Advanced Computing,University of Information Engineering,Zhengzhou 450001,China
Abstract:Large-scale SIMD architecture provides stronger vector parallel support on hardware.However,a large number of loops which are short of iterations can not provide sufficient parallelism,and it is difficult to achieve them with the equivalent vector mode.In order to make full use of SIMD,this paper presented a vectorization method which can use non-full length of SIMD vector instruction.This paper studied the vector register usage,achieved a non-full vector operation based on non-full length usage of vector register,which can vectorize short loops.Finally,this method was used to vectorize the common loops.Moreover,This paper provided a benefit analysis method to guide the vectorization method.Experimental results show that the method is available,the target loops of the selected test programs are vectorized and the average speedup is about 1.2.
Keywords:Large-scale SIMD  Parallel  Vectorization  Non-full vector operation  Benefit analysis
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号