首页 | 本学科首页   官方微博 | 高级检索  
     


(r, n12, s12) measurements on the 2-CPU CRAY X-MP
Authors:Roger W Hockney
Affiliation:Computer Science Department, Reading University, Whiteknights, Reading, Berks, U.K.
Abstract:We report performance measurements made on the 2-CPU CRAY X-MP at ECMWF, Reading. Vector (SIMD) performance on one CPU is interpreted by the two parameters (r, n12), and we find for dyadic operations using FORTRAN r = 70 Mflop/s, n12 = 53 flop. All vector triadic operations produce r = 107 Mflop/s, n12 = 45 flop; and a triadic operation with two vectors and one scalar gives r = 148 Mflop/s and n12 = 60 flop. MIMD performance using both CPUs on one job is interpreted with the two parameters (r, s12), where s12 is the amount of arithmetic that could have been done during the time taken to synchronize the two CPUs. We find, for dyadic operations using the TSKSTART and TSKWAIT synchronization primitives, that r = 130 Mflop/s and s12 = 5700 flop. This means that a job must contain more than ~ 6000 floating-point operations if it is to run at more than 50% of the maximum performance when split between both CPUs by this method. Less expensive synchronization methods using LOCKS and EVENTS reduces s12 to 4000 flop and 2000 flop respectively. A simplified form of LOCK synchronization written in CAL code further reduces s12 to 220 flop. This is probably the minimum possible value for synchronization overhead on the CRAY X-MP.
Keywords:Performance measurements  MIMD computer  CRAY-X MP  synchronisation costs
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号