GPU Fast Convolution via the Overlap-and-Save Method in Shared Memory

Karel Adámek,Sofia Dimoudi,Michael B. Giles,Wesley Armour

GPU Fast Convolution via the Overlap-and-Save Method in Shared Memory

2019

Karel Adámek
Sofia Dimoudi
Michael B. Giles
Wesley Armour

We present an implementation of the overlap-and-save method, a method for the convolution of very long signals with short response functions, which is tailored to GPUs. We have implemented several FFT algorithms (using the CUDA programming language) which exploit GPU shared memory, allowing for GPU accelerated convolution. We compare our implementation with an implementation of the overlap-and-save algorithm utilizing the NVIDIA FFT library (cuFFT). We demonstrate that by using a shared memory based FFT we can achieved significant speed-ups for certain problem sizes and lower the memory requirements of the overlap-and-save method on GPUs.

Keywords:

Exploit
CUDA
Computer science
Shared memory
Parallel computing
Convolution
Fast Fourier transform
Theoretical computer science
cuda programming

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations