Implementation and Performance of a GPU-Based Monte-Carlo Framework for Determining Design Ice Load

Sara Ayubian,Shadi Alawneh,Martin Richard,Jan Thij ssen

Implementation and Performance of a GPU-Based Monte-Carlo Framework for Determining Design Ice Load

2017

Sara Ayubian
Shadi Alawneh
Martin Richard
Jan Thij ssen

Modern Graphics Processing Units (GPUs) with massive number of threads and many-core architecture support both graphics and general purpose computing. NVIDIA's compute unified device architecture (CUDA) takes advantage of parallel computing and utilizes the tremendous power of GPUs. The present study demonstrates a high performance computing (HPC) framework for a Monte-Carlo simulation to determine design sea ice loads which is implemented in both GPU and CPU. Results show a speedup of up to 130 times for the 4 Tesla K80 GPUs over an optimized CPU OpenMP implementation and speedup of up to 8 times for the 4 Tesla K80 over a single Tesla K80 GPU implementation. The elapsed time of the different implementations has been reduced from about 2.5 hours to 0.7 seconds.

Keywords:

Parallel computing
CUDA Pinned memory
Computer architecture
CUDA
Speedup
Thread (computing)
Central processing unit
Architecture
General-purpose computing on graphics processing units
Supercomputer
Computer science
Monte Carlo method
Graphics

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations