基于计算缓存方法的分子动力学程序性能优化 Performance Optimization of a Molecular Dynamics Code Based on Computational Caching期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于计算缓存方法的分子动力学程序性能优化

引用本文：	张宝印,莫则尧,曹小林. 基于计算缓存方法的分子动力学程序性能优化[J]. 计算机工程与科学, 2009, 31(11). DOI: 10.3969/j.issn.1007-130X.2009.11.020

作者姓名：	张宝印莫则尧曹小林

作者单位：	北京应用物理与计算数学研究所,北京,100094;北京应用物理与计算数学研究所,北京,100094;北京应用物理与计算数学研究所,北京,100094

基金项目：	国家自然科学基金资助项目

摘要：	分子动力学数值模拟程序在现代高性能计算机上的计算效率往往很低,只能发挥系统峰值性能的几个百分点。本文对并行分子动力学程序PMD3D在联想深腾6800超级计算机上进行性能优化。通过性能分析,我们发现粒子相互作用力计算中相互关联的浮点运算严重影响了处理器的指令级并行效率,为此我们应用计算缓存的方法,将大量不规则的浮点计算进行缓存,达到一定规模后再进行向量化计算。这样使得单机性能在优化后提升4倍多,达到处理器峰值性能5.2GFlops的32.3%。最后,在深腾6800的64个节点的256个CPU上进行了并行性能测试,达到峰值运算性能1.3万亿次的27%。
关键词：	分子动力学性能优化计算缓存指令级并行
Performance Optimization of a Molecular Dynamics Code Based on Computational Caching

ZHANG Bao-yin,MO Ze-yao,CAO Xiao-lin. Performance Optimization of a Molecular Dynamics Code Based on Computational Caching[J]. Computer Engineering & Science, 2009, 31(11). DOI: 10.3969/j.issn.1007-130X.2009.11.020

Authors:	ZHANG Bao-yin MO Ze-yao CAO Xiao-lin

Abstract:	The codes of molecular dynamics always run in a low performance manner, and achieve only several percents of the peak performance on modern supercomputers. In this paper, we optimize the program PMD3D on the Shenteng 6800 supercomputer. By performance analysis, we obtain that the dependence of the long-latency operations heavily influences the parallel efficiency of instructions. Based on the analysis, we optimize the codes, cache the amount irregular computations, and compute them vectorially. We achieve 32. 3% of the peak performance and 4×speedups on the Itanium 2 processor. Fi-nally, we test the program on the Shenteng 6800 with 256 CPUs in 64 nodes, and achieve 27% of the peak performance 1. 3TFlops.

Keywords:	molecular dynamics performance optimization computational caching ILP
本文献已被万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏