主成分线性回归模型分析应用程序性能 Principal Component Linear Regression Analysis on Performance of Applications期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

主成分线性回归模型分析应用程序性能

引用本文：	李胜梅,程步奇,高兴誉,乔林,汤志忠.主成分线性回归模型分析应用程序性能[J].计算机研究与发展,2009,46(11).

作者姓名：	李胜梅程步奇高兴誉乔林汤志忠

作者单位：	1. 清华大学计算机科学与技术系,北京,100084 2. 英特尔中国研究中心编程系统实验室,北京,100080 3. 中国科学院数学与系统科学研究院,北京,100190

基金项目：	国家自然科学基金项目，国家"八六三"高技术研究发展计划基金项目，国家"九七三"重点基础研究发展计划基金项目

摘要：	应用程序的性能分析能够给体系架构设计者和性能优化者提供有效的参考和指导.采用主成分线性回归模型分析了SPEC CPU2006的整型程序性能.模型选取性能监测单元采样到的事件为自变量,每条指令的时钟周期数(CPI)作为因变量.模型中采用主成分分析法消除了性能事件之间的相关性.实验结果表明,模型的拟合优度在90%以上,对性能进行预测的平均相对误差为15%.模型从量化上分析了L1,L2高速缓存缺失作为影响性能的关键因素是怎样影响程序性能的.
关键词：	性能分析 cache缺失主成分分析线性回归
Principal Component Linear Regression Analysis on Performance of Applications

Li Shengmei,Cheng buqi,Gao Xingyu,Qiao Lin,Tang Zhizhong.Principal Component Linear Regression Analysis on Performance of Applications[J].Journal of Computer Research and Development,2009,46(11).

Authors:	Li Shengmei Cheng buqi Gao Xingyu Qiao Lin Tang Zhizhong

Abstract:	The factors influencing application performance are various and the extents of influence are different. Analyzing and distinguishing the extents of influence caused by various factors can guide the architects in the architecture design and help programmers in the optimization. However, it is not easy to distinguish the extents of influence because the factors may correlate each other themselves. In this paper, a principal component linear regression model aiming at performance of SPEC CPU2006 integer benchmarks is set up. Cycles per instruction (CPI) is used to represent the application performance and the performance events monitored by performance monitor unit (PMU) are used to represent the influencing factors. Principal component analysis is implemented to eliminate the linear correlation among performance events. Then linear regression model is set up which uses CPI as the dependent variable and principal components as the independent variables. This model can analyze the influence on CPI caused by the performance events i. e. L1 data cache miss, L2 cache miss, DTLB miss, branch mis-prediction, micro-fusion, memory disambiguation events quantitatively. The model is validated by the t test and F test with goodness of fit over 90%. The average relative prediction error of the model is 15%. The results show quantitatively how L1 and L2 cache misses dominate the performance of the applications.

Keywords:	SPEC CPU2006
本文献已被万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏