Highly Scalable Near Memory Processing with Migrating Threads on the Emu System Architecture.
Author(s) -
Timothy J. Dysart,
Peter M. Kogge,
Martin M. Deneroff,
Eric Bovell,
Preston Briggs,
Jay B. Brockman,
Kenneth Jacobsen,
Yujen Juan,
Shan K. Kuntz,
Richard A. Lethin,
Janice O. McMahon,
Chandra Pawar,
Martin Perrigo,
Sarah Rucker,
John Ruttenberg,
Max Ruttenberg,
Steve Stein
Publication year - 2016
Publication title -
2016 6th workshop on irregular applications: architecture and algorithms (ia3)
Language(s) - English
DOI - 10.1109/ia3.2016.7
There is growing evidence that current architectures do not well handle cache-unfriendly applications such as sparse math operations, data analytics, and graph algorithms. This is due, in part, to the irregular memory access patterns demonstrated by these applications, and in how remote memory accesses are handled. This paper introduces a new, highly-scalable PGAS memory-centric system architecture where migrating threads travel to the data they access. Scaling both memory capacities and the number of cores can be largely invisible to the programmer. The first implementation of this architecture, implemented with FPGAs, is discussed in detail. A comparison of key parameters with a variety of today's systems, of differing architectures, indicates the potential advantages. Early projections of performance against several well-documented kernels translate these advantages into comparative numbers. Future implementations of this architecture may expand the performance advantages by the application of current state of the art silicon technology.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom