Pushing the limits of RNN Compression

Urmish Thakker,Igor Fedorov,Jesse Beu,Dibakar Gope,Chu Zhou,Ganesh Dasika,Matthew Mattina

Pushing the limits of RNN Compression

2019

Urmish Thakker
Igor Fedorov
Jesse Beu
Dibakar Gope
Chu Zhou
Ganesh Dasika
Matthew Mattina

Recurrent Neural Networks (RNN) can be difficult to deploy on resource constrained devices due to their size. As a result, there is a need for compression techniques that can significantly compress RNNs without negatively impacting task accuracy. This paper introduces a method to compress RNNs for resource constrained environments using Kronecker product (KP). KPs can compress RNN layers by 16–38 × with minimal accuracy loss. We show that KP can beat the task accuracy achieved by other state-of-the-art compression techniques across 4 benchmarks spanning 3 different applications, while simultaneously improving inference run-time.

Keywords:

Algorithm
Kronecker product
task
resource constrained
Task analysis
Computer science
Efficient energy use
Inference
compression
Recurrent neural network

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations