z-logo
open-access-imgOpen Access
Measuring Memory Hierarchy Performance of Cache-Coherent Multiprocessors Using Micro Benchmarks
Author(s) -
Cristina Hristea Seibert,
Daniel Lenoski,
John S. Keen
Publication year - 1997
Publication title -
acm/ieee sc 1997 conference (sc'97)
Language(s) - English
DOI - 10.1109/sc.1997.10042
Even with today's large caches, the increasing performance gap between processors and memory systems imposes a memory bottleneck for many important scientific and com mercial applications. This bottleneck is intensified in shared-memory multiprocessors by contention and the ef fects of cache coherency. Under heavy memory contention, the memory latency may increase two or three times. Nonethless, as more sophisticated techniques are used to hide latency and increase bandwidth, measuring memory performance has become increasingly difficult. Previous simple methods to measure memory performance can overestimate unipro cessor memory latency and underestimate bandwidth by tens of percent. We introduce a micro benchmark suite that measures memory hierarchy performance in light of both uniprocessor optimizations and the contention and coherence effects of multiprocessors. The benchmark suite has been used to improve the memory system performance of the SGI Origin multiprocessor.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom