首页 | 本学科首页   官方微博 | 高级检索  
     


Optimizing process allocation of parallel programs for heterogeneous clusters
Authors:Shuichi Ichikawa  Sho Takahashi  Yuu Kawai
Abstract:The performance of a conventional parallel application is often degraded by load‐imbalance on heterogeneous clusters. Although it is simple to invoke multiple processes on fast processing elements to alleviate load‐imbalance, the optimal process allocation is not obvious. Kishimoto and Ichikawa presented performance models for high‐performance Linpack (HPL), with which the sub‐optimal configurations of heterogeneous clusters were actually estimated. Their results on HPL are encouraging, whereas their approach is not yet verified with other applications. This study presents some enhancements of Kishimoto's scheme, which are evaluated with four typical scientific applications: computational fluid dynamics (CFD), finite‐element method (FEM), HPL (linear algebraic system), and fast Fourier transform (FFT). According to our experiments, our new models (NP‐T models) are superior to Kishimoto's models, particularly when the non‐negative least squares method is used for parameter extraction. The average errors of the derived models were 0.2% for the CFD benchmark, 2% for the FEM benchmark, 1% for HPL, and 28% for the FFT benchmark. This study also emphasizes the importance of predictability in clusters, listing practical examples derived from our study. Copyright © 2008 John Wiley & Sons, Ltd.
Keywords:heterogeneous cluster  high‐performance computing  performance evaluation  multiprocessing  optimization
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号