From Processing-in-Memory to Processing-in-Storage
Author(s) -
Roman Kaplan,
Leonid Yavits,
Ran Ginosar
Publication year - 2017
Publication title -
supercomputing frontiers and innovations
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.375
H-Index - 16
eISSN - 2409-6008
pISSN - 2313-8734
DOI - 10.14529/jsfi170307
Subject(s) - computer science , parallel computing , interleaved memory , bottleneck , data deduplication , computer data storage , memory map , memory bandwidth , memory hierarchy , computer memory , throughput , massively parallel , registered memory , stream processing , uniform memory access , data processing , speedup , parallel processing , computer hardware , memory management , semiconductor memory , embedded system , cache , shared memory , operating system , wireless
Near-data in-memory processing research has been gaining momentum in recent years. Typical processing-in-memory architecture places a single or several processing elements next to a volatile memory, enabling processing without transferring data to the host CPU. The increased bandwidth to and from volatile memory leads to performance gain. However processing-in-memory does not alleviate von Neumann bottleneck for big data problems, where datasets are too large to fit in main memory. We present a novel processing-in-storage system based on Resistive Content Addressable Memory ReCAM. It functions simultaneously as a mass storage and as a massively parallel associative processor. ReCAM processing-in-storage resolves the bandwidth wall by keeping computation inside the storage arrays, without transferring it up the memory hierarchy. We show that ReCAM based processing-in-storage architecture may outperform existing processing-in-memory and accelerator based designs. ReCAM processing-in-storage implementation of Smith-Waterman DNA sequence alignment reaches a speedup of almost five over a GPU cluster. An implementation of in-storage inline data deduplication is presented and shown to achieve orders of magnitude higher throughput than traditional CPU and DRAM based systems.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom