Learning to quantize deep neural networks: a competitive-collaborative approach

Md. Fahim Faysal Khan,Mohammad Mahdi Kamani,Mehrdad Mahdavi,Vijaykrishnan Narayanan

Learning to quantize deep neural networks: a competitive-collaborative approach

2020

Md. Fahim Faysal Khan
Mohammad Mahdi Kamani
Mehrdad Mahdavi
Vijaykrishnan Narayanan

Reducing the model size and computation costs for dedicated AI accelerator designs, neural network quantization methods have attracted momentous attention recently. Unfortunately, merely minimizing quantization loss using constant discretization causes accuracy deterioration. In this paper, we propose an iterative accuracy-driven learning framework of competitive-collaborative quantization (CCQ) to gradually adapt the bit-precision of each individual layer. Orthogonal to prior quantization policies working with full precision for the first and last layers of the network, CCQ offers layer-wise competition for any target quantization policy with holistic layer fine-tuning to recover accuracy, where the state-of-the-art networks can be entirely quantized without any significant accuracy degradation.

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations