首页 | 本学科首页   官方微博 | 高级检索  
     


Resolving the GPU responsiveness dilemma through program transformations
Authors:Qi Zhu  Bo Wu  Xipeng Shen  Kai Shen  Li Shen  Zhiying Wang
Affiliation:1. National Key Laboratory of High Performance Computing, National University of Defense Technology, Changsha 410073, China2. Jiangnan Institute of Computing Technology,Wuxi 214083, China3. Department of Computer Science, North Carolina State University, Raleigh NC 27695, USA4. EECS, Colorado School of Mines, Golden CO 80401, USA5. Department of Computer Science, University of Rochester, Rochester NY 14627, USA
Abstract:The emerging integrated CPU–GPU architectures facilitate short computational kernels to utilize GPU acceleration. Evidence has shown that, on such systems, the GPU control responsiveness (how soon the host program finds out about the completion of a GPU kernel) is essential for the overall performance. This study identifies the GPU responsiveness dilemma: host busy polling responds quickly, but at the expense of high energy consumption and interference with co-running CPU programs; interrupt-based notification minimizes energy and CPU interference costs, but suffers from substantial response delay. We present a program level solution that wakes up the host program in anticipation of GPU kernel completion. We systematically explore the design space of an anticipatory wakeup scheme through a timer-delayed wakeup or kernel splitting-based pre-completion notification. Experiments show that our proposed technique can achieve the best of both worlds, high responsiveness with low power and CPU costs, for a wide range of GPU workloads.
Keywords:program transformation  GPU  integrated architecture  responsiveness  
本文献已被 SpringerLink 等数据库收录!
点击此处可从《Frontiers of Computer Science》浏览原始摘要信息
点击此处可从《Frontiers of Computer Science》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号