首页 | 本学科首页   官方微博 | 高级检索  
     


Impact of Reverse Computing on Information Locality in Register Allocation for High Performance Computing
Authors:Mouad Bahi  Christine Eisenbeis
Affiliation:1. INRIA Saclay ?le-de-France, Orsay, France
2. LRI, Université de Paris-Sud 11, Orsay, France
3. LERMA, Observatoire de Paris, Paris, France
Abstract:Reversible computing aims at keeping all information on input and intermediate values available at any step of the computation, making information virtually present everywhere. Rematerialization in register allocation amounts to recomputing values instead of spilling them in memory when registers run out. In this paper we detail a heuristic algorithm for exploiting reverse computing for register materialization. This improves information locality as it provides more opportunities for retrieving data. Rematerialization adds instructions and we show on one specifically designed example that reverse computing may alleviate the impact of these additional instructions on performance. We also show how thread parallelism may be optimized on GPUs by performing register allocation with reverse recomputing that increases the number of threads per Streaming Multiprocessor. This is done on the main kernel of Lattice Quantum Chromo Dynamics simulation program where we gain a 11 % speedup.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号