首页 | 本学科首页   官方微博 | 高级检索  
     

新型超字级并行改进算法
引用本文:张素平,韩林,丁丽丽,王鹏翔. 新型超字级并行改进算法[J]. 计算机应用, 2017, 37(2): 450-456. DOI: 10.11772/j.issn.1001-9081.2017.02.0450
作者姓名:张素平  韩林  丁丽丽  王鹏翔
作者单位:数学工程与先进计算国家重点实验室(信息工程大学), 郑州 450001
基金项目:“核高基”国家科技重大专项(2009ZX01036-001-001-2)。
摘    要:
对于超字级并行(SLP)算法不能有效地处理大型程序中并行代码率较小,且可向量化的代码中可能存在对向量化不利的代码的问题,提出了一种新型的SLP改进算法NSLPO。首先,将程序中不能向量化的非同构语句进行同构化处理,定位SLP丢失的向量化机会;然后,通过冗余节点添加构建最大通用子图,通过冗余删除等优化过程得到同构化之后的补充SLP图,提高程序中代码的并行性;最后,运用节流法将对向量化有害的代码摒除在向量化之外,仅对它们进行标量处理,通过只向量化处理那些向量化有收益的代码以尽可能地提升程序效率。在一组广泛使用的内核测试集中进行实验,结果显示,与SLP算法相比,NSLPO算法性能更优,其执行时间比SLP平均减少9.1%。

关 键 词:同构  节流法  向量化  超字级并行  补充图  
收稿时间:2016-07-31
修稿时间:2016-10-24

New improved algorithm for superword level parallelism
ZHANG Suping,HAN Lin,DING Lili,WANG Pengxiang. New improved algorithm for superword level parallelism[J]. Journal of Computer Applications, 2017, 37(2): 450-456. DOI: 10.11772/j.issn.1001-9081.2017.02.0450
Authors:ZHANG Suping  HAN Lin  DING Lili  WANG Pengxiang
Affiliation:State Key Laboratory of Mathematical Engineering and Advanced Computing(Information Engineering University), Zhengzhou Henan 450001, China
Abstract:
For SLP (Superword Level Parallelism) algorithm cannot effectively process the large-scale applications covered with few parallel codes, and the codes which can be vectorized may be adverse to vectorization. A new improved algorithm for SLP was proposed, namely NSLPO. First of all, the non-isomorphic statements which cannot be vectorized were transformed to isomorphic statements as far as possible, thus locating the opportunities of vectorization which SLP has lost. Secondly, the Max Common Subgraph (MCS) was built by adding redundant nodes, and the supplement diagram of SLP was got by using some optimization such as redundancy deleting, which can greatly increase the parallelism of program. At last, the codes which are harmful to vectorization were exclued out of vectorization by using cutting method and executed in serial, only the valuable codes for vectorization were vectorized to improve the efficiency of programs as far as possible. Experiments were conducted on widely used kernel test sets. The experimental results show that compared with the SLP algorithm, the proposed NSLPO algorithm has better performance and its running time was reduced by 9.1%.
Keywords:isomorphism   cutting method   vectorization   Superword Level Parallelism (SLP)   supplement diagram
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号