ArchSim: A System-Level Parallel Simulation Platform for the Architecture Design of High Performance Computer |
| |
Authors: | Yong-Qin Huang Hong-Liang Li Xiang-Hui Xie Lei Qian Zi-Yu Hao Feng Guo Kun Zhang |
| |
Affiliation: | Jiangnan Institue of Computing Technology, Wuxi 214083, China |
| |
Abstract: | High performance computer (HPC) is a complex huge system, of which the architecture design meets increasing difficulties and
risks. Traditional methods, such as theoretical analysis, component-level simulation and sequential simulation, are not applicable
to system-level simulations of HPC systems. Even the parallel simulation using large-scale parallel machines also have many
difficulties in scalability, reliability, generality, as well as efficiency. According to the current needs of HPC architecture
design, this paper proposes a system-level parallel simulation platform: ArchSim. We first introduce the architecture of ArchSim
simulation platform which is composed of a global server (GS), local server agents (LSA) and entities. Secondly, we emphasize
some key techniques of ArchSim, including the synchronization protocol, the communication mechanism and the distributed checkpointing/restart
mechanism. We then make a synthesized test of some main performance indices of ArchSim with the phold benchmark and analyze
the extra overhead generated by ArchSim. Finally, based on ArchSim, we construct a parallel event-driven interconnection network
simulator and a system-level simulator for a small scale HPC system with 256 processors. The results of the performance test
and HPC system simulations demonstrate that ArchSim can achieve high speedup ratio and high scalability on parallel host machine
and support system-level simulations for the architecture design of HPC systems. |
| |
Keywords: | |
本文献已被 万方数据 SpringerLink 等数据库收录! |
| 点击此处可从《计算机科学技术学报》浏览原始摘要信息 |
|
点击此处可从《计算机科学技术学报》下载全文 |
|