On latency in GPU throughput microarchitectures

Michael Andersch,Jan Lucas,Mauricio A. Lvarez-Mesa,Ben H. H. Juurlink

On latency in GPU throughput microarchitectures

2015

Michael Andersch
Jan Lucas
Mauricio A. Lvarez-Mesa
Ben H. H. Juurlink

Modern GPUs provide massive processing power (arithmetic throughput) as well as memory throughput. Presently, while it appears to be well understood how performance can be improved by increasing throughput, it is less clear what the effects of micro-architectural latencies are on the performance of throughput-oriented GPU architectures. In fact, little is publicly known about the values, behavior, and performance impact of microarchitecture latency components in modern GPUs. This work attempts to fill that gap by analyzing both the idle (static) as well as loaded (dynamic) latency behavior of GPU microarchitectural components. Our results show that GPUs are not as effective in latency hiding as commonly thought and based on that, we argue that latency should also be a GPU design consideration besides throughput.

Keywords:

Parallel computing
Real-time computing
Computer science
Throughput
Idle
Microarchitecture
Latency (engineering)
random access memory

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations