首页 | 本学科首页   官方微博 | 高级检索  
     


Torus Ring: improving performance of interconnection network by modifying hierarchical ring
Affiliation:1. Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Ashrafi Esfahani Street, Poonak Sq., Tehran, Iran\n;2. Department of Computer Engineering, Sharif University of Technology, Tehran, Iran;3. School of Computer Science, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran;4. Iran Telecommunication Research Center Institute, PO Box 3961-14155, Tehran, Iran;1. Embedded Systems Group, University of Stuttgart, Pfaffenwaldring 5b, Stuttgart 70569, Germany;2. Dept. of Electrical and Computer Engineering, University of Toronto, 10 King’s College Road, Toronto, ON M5S 3G4, Canada;3. Altium Europe GmbH, Philipp-Reis-Strasse 3, Karlsruhe 76137, Germany
Abstract:In multiprocessor systems, interconnection network design is critical for overall system performance. Among the popular interconnection networks, unidirectional ring-based networks have been one of popular choices for high performance large-scale shared memory multiprocessor systems. In this paper, we propose “Torus Ring”, which is a modified version of two-level hierarchical ring. The Torus Ring has the same complexity as the hierarchical rings, and the only difference is the way it connects the local rings. Compared to hierarchical rings, the Torus Ring helps exploit the memory access locality of application programs more efficiently. It has an advantage over the hierarchical ring when the destination of a packet is the adjacent local ring, especially the backward adjacent local ring. Although we assume that the destination of a network packet is uniformly distributed across the processing nodes, the average number of hops in Torus Ring is equal to that of the hierarchical ring. However, the performance gain of the Torus Ring is expected to increase, due to the memory access locality of the application programs in the real parallel programming environment. In the simulation results, the latency of the interconnection network is reduced by up to 19% and the execution time is reduced by up to 10%, with the moderate ring utilization ratio.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号