Supporting compressed-sparse activations and weights on SIMD-like accelerator for sparse convolutional neural networks

Chien Yu Lin,Bo-Cheng Lai

Supporting compressed-sparse activations and weights on SIMD-like accelerator for sparse convolutional neural networks

2018

Sparsity is widely observed in convolutional neural networks by zeroing a large portion of both activations and weights without impairing the result. By keeping the data in a compressed-sparse format, the energy consumption could be considerably cut down due to less memory traffic. However, the wide SIMD-like MAC engine adopted in many CNN accelerators can not support the compressed input due to the data misalignment. In this work, a novel Dual Indexing Module (DIM) is proposed to efficiently handle the alignment issue where activations and weights are both kept in compressed-sparse format. The DIM is implemented in a representative SIMD-like CNN accelerator, and able to exploit both compressed-sparse activations and weights. The synthesis results with 40nm technology have shown that DIM can enhance up to 46% of energy consumption and 55.4% Energy-Delay-Product (EDP).

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations