首页 | 本学科首页   官方微博 | 高级检索  
     

支持SIMD 与簇间双字传输体系下的VLIW DSP 分簇算法
引用本文:陈思灵,郑启龙,冯玉谦,付和萍.支持SIMD 与簇间双字传输体系下的VLIW DSP 分簇算法[J].计算机系统应用,2012,21(10):100-104.
作者姓名:陈思灵  郑启龙  冯玉谦  付和萍
作者单位:中国科学技术大学计算机科学与技术学院,合肥230026
基金项目:基金项目:核高基重大专项(2009ZX01034-001-001-002)
摘    要:VLIW DSP通过软件流水获得时间并行性,通过指令分簇获得空间并行性.指令的分簇本质上是资源分配问题.传统的指令分簇假设一条指令分到某一簇执行,而某些体系结构提供SIMD指令,传统的分簇算法对这类体系结构并不完全适用.提出的基于评估模型的分簇算法能对SIMD指令和普通指令进行合理的分簇.分簇之后,通过调度簇间传输指令,合成适当的簇间双字传输指令.由于SIMD和簇间双字传输的引入,以及较好的分簇决策,程序整体的调度延迟变短.对许多数字信号处理程序相对于没分簇的情况下的性能有2~3倍的性能提升,相对寄存器压力分簇算法有约7~10%性能的提升.

关 键 词:单指令多数据流  指令分簇  簇间双字传输指令  调度延迟  数据流图
收稿时间:2012/2/18 0:00:00
修稿时间:4/3/2012 12:00:00 AM

VLIW DSP Clustering Algorithm for Architecture Supporting SIMD and Inter-Cluster Double Word Transfer
CHEN Si-Ling,ZHENG Qi-Long,FENG Yu-Qian and FU He-Ping.VLIW DSP Clustering Algorithm for Architecture Supporting SIMD and Inter-Cluster Double Word Transfer[J].Computer Systems& Applications,2012,21(10):100-104.
Authors:CHEN Si-Ling  ZHENG Qi-Long  FENG Yu-Qian and FU He-Ping
Affiliation:(School of Computer Science and Technology, University of Science And Technology of China, Hefei 230039, China)
Abstract:VLIW DSP obtain time parallelism through software pipelining, and obtain space parallelism through instruction clustering. The essence of clustering is resource allocation. Traditional clustering assumes that one instruction assigns to certain cluster, but that does not applicable to some architecture offering SIMD instructions. This article proposes an algorithm based on evaluation model can do well with the problem of clustering for ordinary instructions and SIMD instructions. By scheduling inter-cluster transfer instruction, we synthesize inter-cluster double word transfer instruction. With the help of SIMD instruction, inter-cluster double word transfer instruction and good clustering policy decision, we make the schedule latency shorter. For many DSP programs, comparing with no clustering, we obtain 2 - 3 times increase in performance, comparing with clustering algorithm based on register allocation, we obtain 7-10% increase in performance.
Keywords:SIMD  instruction clustering  inter-cluster double word transfer instruction  scheduling delay  DFG
本文献已被 CNKI 维普 等数据库收录!
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号