首页 | 本学科首页   官方微博 | 高级检索  
     

基于安腾微处理器的程序性能优化与分析
引用本文:迟利华,刘杰. 基于安腾微处理器的程序性能优化与分析[J]. 计算机工程与科学, 2011, 33(9): 42. DOI: 10.3969/j.issn.1007-130X.2011.09.008
作者姓名:迟利华  刘杰
作者单位:国防科学技术大学计算机学院,湖南长沙,410073
基金项目:国家自然科学基金,国家863计划资助项目
摘    要:高性能计算越来越广泛地应用到科学和工程的各个领域,但实际应用程序获得的性能并未随着机器峰值性能的提高而同比例提高,应用程序只能发挥峰值性能的5%~10%左右,而且两者的差距在扩大,程序性能优化作为解决该问题的方法之一得到了学术界的广泛关注。本文基于安腾微处理器,总结了程序优化的通用方法,给出了程序优化与分析的一般步骤。根据优化与分析步骤,首先对四个程序进行了详细的性能分析,找到性能瓶颈和重点子程序;然后分别根据四个程序的特点,采用基于Cache和指令流水线的优化技术,对程序进行了性能优化;最后给出了性能优化测试结果,分别得到8%~33%的性能提高,取得了良好的优化效果。

关 键 词:安腾微处理器  程序性能优化  程序性能分析

Code Performance Optimization and Analysis Based on Itaniuam Microprocessors
CHI Li-hua,LIU Jie. Code Performance Optimization and Analysis Based on Itaniuam Microprocessors[J]. Computer Engineering & Science, 2011, 33(9): 42. DOI: 10.3969/j.issn.1007-130X.2011.09.008
Authors:CHI Li-hua  LIU Jie
Abstract:High performance computing is widely used in science and engineering to solve large scale computation problems.But the sustained performances achieved for the real applications do not increase as fast as the peak performances do.In fact,the sustained performance is a only about 5~10% of the peak performance,and the gap between the sustained performance and the peak performance is widening.Code performance optimization,which is one of the effective ways to solve this problem,draws the attentions of the research community.Based on Itanium microprocessors,this paper summarizes the general methods for code performance optimization and gives the common steps for code performance optimization and analysis.According to the steps,the performances for four codes are analysed in detail to find the performance bottlenecks and the key subroutine codes.Then four codes are optimized in the Itanium microprocesspor,using the code optimization techniques based on cache and instruction pipeline.Finally,the test results for the four performance optimization codes show that the performances are increased by 8~33% respectively.
Keywords:itaniuam microprocessor  code performance optimization  code performance analysis
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号