XNORAM: An Efficient Computing-in-Memory Architecture for Binary Convolutional Neural Networks with Flexible Dataflow Mapping

Shiwei Liu,Haozhe Zhu,Chixiao Chen,Lihua Zhang,C.-J. Richard Shi

XNORAM: An Efficient Computing-in-Memory Architecture for Binary Convolutional Neural Networks with Flexible Dataflow Mapping

2020

Shiwei Liu
Haozhe Zhu
Chixiao Chen
Lihua Zhang
C.-J. Richard Shi

In this paper, an energy-efficient computing-inmemory architecture for binary convolutional neural networks, called XNORAM, is proposed. The XNORAM employs 6T feature cells and 10T weight cells to form one XNORAM column. Multiplexed XNOR operations are embedded in each column. To address the data reuse in convolutional neural networks, flexible dataflow mapping is supported on XNORAM to minimize the external data access. To verify the architecture, we design a 4-KB XNORAM prototype in 65nm CMOS technology. It achieves a throughput of 18. 5GOPs at 100-MHz clock rate and 1.0-V power supply. XNOR-AlexNet is performed on the design achieving 39.86 TOPS/W and 4.63 GOPS/KB utilization with only 1.3% accuracy loss comparing to the original XNOR-Net result on GPUs.

Keywords:

Parallel computing
Architecture
Convolutional neural network
in memory
Dataflow
Binary number
Computer science
Memory architecture
Clock rate
Throughput
CMOS
Computer architecture
XNOR gate
Multiplexing

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations