首页 | 本学科首页   官方微博 | 高级检索  
     

魂芯DSP上复数类型的支持和优化
引用本文:王玉林,郑启龙,赵高义.魂芯DSP上复数类型的支持和优化[J].计算机系统应用,2017,26(9):40-45.
作者姓名:王玉林  郑启龙  赵高义
作者单位:1. 中国科学技术大学 计算机科学与技术学院, 合肥 230027;2. 中国科学技术大学 安徽省高性能计算重点实验室, 合肥 230027,1. 中国科学技术大学 计算机科学与技术学院, 合肥 230027;2. 中国科学技术大学 安徽省高性能计算重点实验室, 合肥 230027,1. 中国科学技术大学 计算机科学与技术学院, 合肥 230027;2. 中国科学技术大学 安徽省高性能计算重点实验室, 合肥 230027
基金项目:“核高基”重大专项(2012ZX01034-001-001)
摘    要:魂芯DSP是一款采用VLIW和SIMD架构的针对高性能计算领域而设计的32bit静态标量数字信号处理器.为了满足数字高性能计算的性能要求,魂芯DSP提供了丰富的复数指令,而编译器不能直接利用这些复数指令来提升编译性能.因此针对魂芯DSP芯片提供了大量的复数类操作指令的特点,在传统开源编译器Open64的编译框架基础上进行研究,实现了复数作为编译器基础类型和复数运算操作的支持.同时,通过识别特定的复数类操作的模式利用魂芯DSP上的复数类指令对程序编译优化.实验结果表明,该实现方案在魂芯DSP编译器上对复数程序优化后能够取得平均5.28的加速比.

关 键 词:编译优化  分簇体系DSP  复数指令  Open64编译器
收稿时间:2016/12/28 0:00:00

Complex Data Type Support and Optimization for BWDSP
WANG Yu-Lin,ZHENG Qi-Long and ZHAO Gao-Yi.Complex Data Type Support and Optimization for BWDSP[J].Computer Systems& Applications,2017,26(9):40-45.
Authors:WANG Yu-Lin  ZHENG Qi-Long and ZHAO Gao-Yi
Affiliation:1. School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China;2. Anhui High Performance Computing Key Laboratory, University of Science and Technology of China, Hefei 230027, China,1. School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China;2. Anhui High Performance Computing Key Laboratory, University of Science and Technology of China, Hefei 230027, China and 1. School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China;2. Anhui High Performance Computing Key Laboratory, University of Science and Technology of China, Hefei 230027, China
Abstract:BWDSP is a 32bit static scalar digital signal processor with VLIW and SIMD features, which is designed for high performance computing. In order to meet the performance requirements of digital high-performance computing, the soul core DSP provides a rich set of complex instructions, and the compiler cannot directly use these complex instructions to improve the compilation performance. Since BWDSP has a wealth of complex type of instructions, and it has high performance demands in the radar digital signal field, the implementation is researched according to the characteristics of BWDSP features based on the traditional open-source Open64 compiler framework to achieve the complex data type and complex operations support operations, and further optimization of complex instruction is realized by identifying a specific type of complex operation of a series of patterns. The experimental results show that the implementation on BWDSP compiler can achieve 5.28 time performance improvement on average.
Keywords:compiler optimization  multi-cluster DSP  complex instructions  Open64 compiler
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号