首页 | 本学科首页   官方微博 | 高级检索  
     


Exploiting the capabilities of modern GPUs for dense matrix computations
Authors:Sergio Barrachina  Maribel Castillo  Francisco D Igual  Rafael Mayo  Enrique S Quintana‐Ortí  Gregorio Quintana‐Ortí
Abstract:We present several algorithms to compute the solution of a linear system of equations on a graphics processor (GPU), as well as general techniques to improve their performance, such as padding and hybrid GPU‐CPU computation. We compare single and double precision performance of a modern GPU with unified architecture, and show how iterative refinement with mixed precision can be used to regain full accuracy in the solution of linear systems, exploiting the potential of the processor for single precision arithmetic. Experimental results on a GTX280 using CUBLAS 2.0, the implementation of BLAS for NVIDIA® GPUs with unified architecture, illustrate the performance of the different algorithms and techniques proposed. Copyright © 2009 John Wiley & Sons, Ltd.
Keywords:linear systems  graphics processors (GPUs)  dense linear algebra  high performance
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号