
BooLUT-CIM: A 52.4 TOPS/W Radix-4 Booth-LUT Digital CIM With Negative-Magnitude-Bits Inversion Storage
Author(s) -
Yi Yang,
Xiao Tan,
Jinwu Chen,
Yucheng Du,
Tianhui Jiao,
Xing Wang,
An Guo,
Xin Si
Publication year - 2025
Publication title -
ieee transactions on circuits and systems ii: express briefs
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.799
H-Index - 115
eISSN - 1558-3791
pISSN - 1549-7747
DOI - 10.1109/tcsii.2025.3590745
Subject(s) - components, circuits, devices and systems
Compute-in-Memory (CIM) is an effective approach to boost the efficiency of AI hardware by reducing data movement. Digital CIM (DCIM) has garnered extensive attention owing to its immunity to PVT variations and high precision. Recently, a lookup table (LUT)-based DCIM architecture was proposed, which stores the precomputed summations of adjacent weights in memory, thereby alleviating the substantial overhead of adder trees in traditional DCIM designs. However, the efficiency of LUT-based DCIM is difficult to further improve due to the bit-serial input method and the power-hungry LUT readout operations. To address this challenge, this work presents a BooLUT-CIM design that integrates the advantages of LUT-based CIM and Booth algorithm, reducing the quantity of lookup and addition operations by a half. A LUT compression method derived from computation-friendly numerical relationships is utilized to reduce circuit overhead. Additionally, a negative magnitude-bits inversion storage scheme is proposed to reduce inefficient discharge, saving 38% of LUT array readout power on average. A prototype of this design is implemented in 28nm CMOS technology for validation. Simulation results show that the macro achieves an energy efficiency of 52.4 TOPS/W at 0.9V, 400 MHz, which is 1.3× higher than prior arts.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom