首页 | 本学科首页   官方微博 | 高级检索  
     

神威太湖之光上分子动力学模拟的性能优化
引用本文:田卓,陈一峯. 神威太湖之光上分子动力学模拟的性能优化[J]. 软件学报, 2021, 32(9): 2945-2962
作者姓名:田卓  陈一峯
作者单位:北京大学信息科学与技术学院,北京 100871
基金项目:国家重点研发计划(2017YFB0202001);国家自然科学基金(61432018,61672208)
摘    要:"神威·太湖之光"国产超级计算机的特点是适用于高通量计算系统,此类系统往往存储器访问延迟,网络延迟较长.在实际应用中,有一大类问题是时间演化的模拟问题,往往需要高频状态迭代,每次迭代需要通信.此类应用问题的典型代表是分子动力学模拟,分子的性质依赖于时间演化,导致状态相关的时间尺度上难以并行化.实际应用中,全原子模型需要...

关 键 词:神威太湖之光  分子动力学  迭代  异构  同步
收稿时间:2018-11-08
修稿时间:2019-10-25

Performance Optimization of Molecular Dynamics Simulation on Sunway TaihuLight System
TIAN Zhuo,Chen Yi-Feng. Performance Optimization of Molecular Dynamics Simulation on Sunway TaihuLight System[J]. Journal of Software, 2021, 32(9): 2945-2962
Authors:TIAN Zhuo  Chen Yi-Feng
Affiliation:School of Electronics Engineering and Computer Science, Peking University, Beijing 100871, China
Abstract:Sunway TaihuLight supercomputer is suitable for high-throughput computing systems, which tend to have memory access latency and network latency. There is a large class of problems namely time-to-solution, which requires high frequency iterations. The typical application of time-to-solution problems is molecular dynamics simulation. Computations in molecular dynamics simulation depend on the time. Therefore, the iterative computations are difficult to be parallelized. Time scale usually exceeds microsecond, which means that the number of steps is more than 1012. It is impossible to finish effective simulation in a limited time on long latency system. Therefore, the main performance bottleneck on long latency Sunway system is how to increase the iterative frequency. This study proposes a series of optimization strategies to improve the iterative frequency:(1) Reducing communication overhead and network competition costs through single-core communication combined with on-chip synchronization; (2) Optimizating the speed of synchronization between cores through waiting the shared memory variable and synchronizing the computing processing elements; (3) Reducing the data dependencies by changing the computation patterns; (4) Covering up the memory access latency by overlapping computation and communication; (5) Regulating the data structure to improve accessibility.
Keywords:Sunway TaihuLight  molecular dynamics  iteration  heterogeneous  synchronization
本文献已被 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号