首页 | 本学科首页   官方微博 | 高级检索  
     

TACLeBench中内核程序循环级推测并行性分析
引用本文:孟慧玲,王耀彬,李凌,杨洋,王欣夷,刘志勤.TACLeBench中内核程序循环级推测并行性分析[J].计算机应用,2021,41(9):2652-2657.
作者姓名:孟慧玲  王耀彬  李凌  杨洋  王欣夷  刘志勤
作者单位:1. 西南科技大学 计算机科学与技术学院, 绵阳 四川 621010;2. 四川省计算机研究院, 成都 610041
基金项目:国家自然科学基金面上项目(61672438)。
摘    要:线程级推测(TLS)技术可挖掘程序并行执行潜能,提高多核资源利用率,但目前TACLeBench的内核基准仍未在TLS并行化中得到有效分析。针对该问题设计了循环级推测执行的剖析方案和剖析工具。选取7个代表性的TACLeBench内核基准程序,首先对程序进行初始化分析,选取程序热点片段插入循环标识;其次对这些片段进行交叉编译,记录程序推测线程与内存地址相关数据,剖析其循环级最大潜在并行性;最后综合探讨程序运行时的特征(线程粒度、可并行化覆盖率、依赖特征)以及源码对加速比的影响。实验结果表明:1)该类程序适合采用TLS加速,与串行执行结果相比,循环结构的推测执行下的大部分程序的加速比在2以上,其中最高加速比达到20.79;2)利用TLS加速TACLeBench内核程序时,多数应用可有效利用4核到16核的计算资源。

关 键 词:线程级推测  多核  并行  TACLeBench  内核基准  
收稿时间:2020-11-17
修稿时间:2021-02-07

Loop-level speculative parallelism analysis of kernel program in TACLeBench
MENG Huiling,WANG Yaobin,LI Ling,YANG Yang,WANG Xinyi,LIU Zhiqin.Loop-level speculative parallelism analysis of kernel program in TACLeBench[J].journal of Computer Applications,2021,41(9):2652-2657.
Authors:MENG Huiling  WANG Yaobin  LI Ling  YANG Yang  WANG Xinyi  LIU Zhiqin
Affiliation:1. School of Computer Science and Technology, Southwest University of Science and Technology, Mianyang Sichuan 621010, China;2. Sichuan Institute of Computer Sciences, Chengdu Sichuan 610041, China
Abstract:Thread-Level Speculation (TLS) technology can tap the parallel execution potential of programs and improve the utilization of multi-core resources. However, the current TACLeBench kernel benchmarks are not effectively analyzed in TLS parallelization. In response to this problem, the loop-level speculative execution analysis scheme and analysis tool were designed. With 7 representative TACLeBench kernel benchmarks selected, firstly, the initialization analysis was performed to the programs, the program hot fragments were selected to insert the loop identifier. Then, the cross-compilation was performed to these fragments, the program speculative thread and the memory address related data were recorded, and the maximun potential of the loop-level parallelism was analyzed. Finally, the program runtime characteristics (thread granularity, parallelizable coverage, dependency characteristics) and the impacts of the source code on the speedup ratio were comprehensively discussed. Experimental results show that:1) this type of programs is suitable for TLS acceleration, compared with serial execution results, under the loop structure speculative execution, the speedup ratios for most programs are above 2, and the highest speedup ratio in them can reach 20.79; 2) by using TLS to accelerate the TACLeBench kernel programs, most applications can effectively make use of 4-core to 16-core computing resources.
Keywords:Thread-Level Speculation (TLS)  multi-core  parallel  TACLeBench  kernel benchmark  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号