首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于剪切的SLP向量化方法*
引用本文:李颖颖,奚慧兴,高伟,李伟,翟胜伟. 一种基于剪切的SLP向量化方法*[J]. 计算机应用研究, 2018, 35(9)
作者姓名:李颖颖  奚慧兴  高伟  李伟  翟胜伟
作者单位:解放军信息工程大学,鞍山师范学院,解放军信息工程大学,中国电子科技集团公司第二十七研究所,中国电子科技集团公司第二十七研究所
基金项目:国家自然科学基金项目(面上项目,重点项目,重大项目);国家高科技发展规划项目(“863”计划)
摘    要:作为多媒体和科学计算等领域重要的程序加速器件之一,SIMD扩展部件现已广泛集成于各类处理器中。自动向量化方法是目前生成SIMD向量化程序的重要手段,超字并行SLP (Superword Level Parallelism)方法现已广泛应用于编译器中,并成为实现基本块级代码向量化的主要手段。SLP在进行收益评估时仅考虑代码段整体向量化的收益,并没有考虑到向量化收益为负的片段会降低最终整体的向量化收益,从而导致SLP方法无法达到最好的向量化效果。基于此,本文提出了一种基于剪切的SLP向量化方法(Throttling SLP,TSLP),通过寻找最优的向量化子图,去除了向量化收益为负的代码段,从而可以获得更好的向量化效果。通过标准测试程序的实验结果表明,与原来的SLP方法相比,TSLP方法平均能够获得9%的性能提升。

关 键 词:单指令多数据扩展部件;自动向量化;超字并行;代码模型
收稿时间:2017-04-18
修稿时间:2018-08-07

A SLP Vectorization Method based on Throttling *
LI Ying-Ying,XI Hui-Xing,GAO Wei,LI Wei and ZHAI Sheng-Wei. A SLP Vectorization Method based on Throttling *[J]. Application Research of Computers, 2018, 35(9)
Authors:LI Ying-Ying  XI Hui-Xing  GAO Wei  LI Wei  ZHAI Sheng-Wei
Affiliation:PLA Information Engineering University,,,,
Abstract:SIMD vectors are widely adopted in modern general purpose processors as they can boost performance and energy efficiency for media and scientific applications. Compiler-based automatic vectorization is one approach for generating code that makes efficient use of the SIMD units. The Superword-Level Parallelism (SLP) vectorization algorithm is the most well-known implementation of automatic vectorization. Choosing whether to vectorize is a one-off decision for the whole graph that has been generated. However, this is sub-optimal because the graph may contain code that is harmful to vectorization due to the need to move data from scalar registers into vectors. Therefore, a solution is proposed to overcome this limitation by introducing Throttling SLP (TSLP), a novel vectorization algorithm that finds the optimal graph to vectorize. The decision does not consider the potential benefits of throttling the graph by removing this harmful code. Our experiments show that TSLP can decrease execution time by 9% compared to SLP on average.
Keywords:SIMD extension  auto-vectorization  SLP  cost model
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号