A Note on the GPU Acceleration of Eigenvalue Computations

Karl Rupp,Ph. Tillet,B.F. Smith,Tibor Grasser,Ansgar Jüngel

A Note on the GPU Acceleration of Eigenvalue Computations

2013

Karl Rupp
Ph. Tillet
B.F. Smith
Tibor Grasser
Ansgar Jüngel

Eigenvalue computations for large sparse matrices such as the Lanczos method are commonly based on Krylov subspace techniques. One of the dominant operations in such algorithms are iterated computations of inner products with the same vector in order to preserve orthogonality of the Krylov basis. These operations can be accelerated by existing BLAS functionality using GPUs. However, this is not fully efficient due to unnecessary memory transfers. We present improved implementations in CUDA and OpenCL, which are now available in ViennaCL, PETSc and SLEPc, and demonstrate an up to two-fold performance gain over existing GPU vendor libraries.

Keywords:

CUDA
Lanczos resampling
Eigenvalues and eigenvectors
Iterated function
Sparse matrix
Parallel computing
Linear algebra
Orthogonality
Krylov subspace
Computer science
Theoretical computer science
Computational science
Computation

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations