Virtual thread: maximizing thread-level parallelism beyond GPU scheduling limit

Myung Kuk Yoon,Keunsoo Kim,Sangpil Lee,Won Woo Ro,Murali Annavaram

Virtual thread: maximizing thread-level parallelism beyond GPU scheduling limit

2016

Myung Kuk Yoon
Keunsoo Kim
Sangpil Lee
Won Woo Ro
Murali Annavaram

Modern GPUs require tens of thousands of concurrent threads to fully utilize the massive amount of processing resources. However, thread concurrency in GPUs can be diminished either due to shortage of thread scheduling structures (scheduling limit), such as available program counters and single instruction multiple thread stacks, or due to shortage of on-chip memory (capacity limit), such as register file and shared memory. Our evaluations show that in practice concurrency in many general purpose applications running on GPUs is curtailed by the scheduling limit rather than the capacity limit. Maximizing the utilization of on-chip memory resources without unduly increasing the scheduling complexity is a key goal of this paper.

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations