Experiences with Mapping Non-linear Memory Access Patterns into GPUs

Eladio Gutiérrez,Sergio Romero,María A. Trenas,Oscar G. Plata

Experiences with Mapping Non-linear Memory Access Patterns into GPUs

2009

Eladio Gutiérrez
Sergio Romero
María A. Trenas
Oscar G. Plata

Modern Graphics Processing Units (GPU) are very powerful computational systems on a chip. For this reason there is a growing interest in using these units as general purpose hardware accelerators (GPGPU). To facilitate the programming of general purpose applications, NVIDIA introduced the CUDA programming environment. CUDA provides a simplified abstraction of the underlying complex GPU architecture, so as a number of critical optimizations must be applied to the code in order to get maximum performance. In this paper we discuss our experience in porting an application kernel to the GPU, and all classes of design decisions we adopted in order to obtain maximum performance.

Keywords:

System on a chip
CUDA
CUDA Pinned memory
Porting
Architecture
Parallel computing
Shared memory
Hardware acceleration
General-purpose computing on graphics processing units
Computer science
Graphics
Kernel (linear algebra)
Chip

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations