Highly scalable near memory processing with migrating threads on the emu system architecture

Timothy J. Dysart,Peter M. Kogge,Martin M. Deneroff,Eric Bovell,Preston Briggs,Jay B. Brockman,Kenneth Jacobsen,Yujen Juan,Shannon K. Kuntz,Richard Lethin,Janice O. McMahon,Chandra Pawar,Martin Perrigo,Sarah Rucker,John Ruttenberg,Max Ruttenberg,Steve Stein

Highly scalable near memory processing with migrating threads on the emu system architecture

2016

There is growing evidence that current architectures do not well handle cache-unfriendly applications such as sparse math operations, data analytics, and graph algorithms. This is due, in part, to the irregular memory access patterns demonstrated by these applications, and in how remote memory accesses are handled. This paper introduces a new, highly-scalable PGAS memory-centric system architecture where migrating threads travel to the data they access. Scaling both memory capacities and the number of cores can be largely invisible to the programmer.The first implementation of this architecture, implemented with FPGAs, is discussed in detail. A comparison of key parameters with a variety of today's systems, of differing architectures, indicates the potential advantages. Early projections of performance against several well-documented kernels translate these advantages into comparative numbers. Future implementations of this architecture may expand the performance advantages by the application of current state of the art silicon technology.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations