首页 | 本学科首页   官方微博 | 高级检索  
     


A compiler for exploiting nested parallelism in OpenMP programs
Authors:Xinmin Tian  Jay P Hoeflinger  Grant Haab  Yen-Kuang Chen  Milind Girkar  Sanjiv Shah
Affiliation:aIntel Compiler Labs, Software and Solutions Group, Intel Corporation 3600 Juliette Lane, Santa Clara, CA 95052, USA;bApplication Research Lab, Core Technology Group, Intel Corporation 3600 Juliette Lane, Santa Clara, CA 95052, USA;cParallel & Distributed Solutions Division, Software and Solutions Group, Intel Corporation 1906 Fox Drive, Champaign, IL 61820, USA
Abstract:This paper presents the design and implementation of a parallelization framework and OpenMP runtime support in Intel® C++ & Fortran compilers for exploiting nested parallelism in applications using OpenMP pragmas or directives. We conduct the performance evaluation of two multimedia applications parallelized with OpenMP pragmas and compiled with the Intel C++ compiler on Hyper-Threading Technology (HT) enabled multiprocessor systems. The performance results show that the multithreaded code generated by the Intel compiler achieved a speedup up to 4.69 on 4 processors with HT enabled for five different input video sequences for the H.264 encoder workload, and a 1.28 speedup on an HT enabled single-CPU system and 1.99 speedup on an HT-enabled dual-CPU system for the audio–visual speech recognition workload. The performance gain due to exploiting nested parallelism for leveraging Hyper-Threading Technology is up to 70% for two multimedia workloads under different multiprocessor system configurations. These results demonstrate that hyper-threading benefits can be achieved by exploiting nested parallelism through Intel compiler and runtime system support for OpenMP programs.
Keywords:Compiler parallelization  Nested parallelism  OpenMP  Hyper-threading  Performance
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号