On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition

Rohit Prabhavalkar,Ouais Alsharif,Antoine Bruguier,Lan McGraw

On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition

2016

Rohit Prabhavalkar
Ouais Alsharif
Antoine Bruguier
Lan McGraw

We study the problem of compressing recurrent neural networks (RNNs). In particular, we focus on the compression of RNN acoustic models, which are motivated by the goal of building compact and accurate speech recognition systems which can be run efficiently on mobile devices. In this work, we present a technique for general recurrent model compression that jointly compresses both recurrent and non-recurrent inter-layer weight matrices. We find that the proposed technique allows us to reduce the size of our Long Short-Term Memory (LSTM) acoustic model to a third of its original size with negligible loss in accuracy.

Keywords:

Mobile device
Speech recognition
Machine learning
Time delay neural network
Artificial intelligence
Acoustic model
Computer science
Compression (physics)
Pattern recognition
Recurrent neural network
Matrix (mathematics)
model compression
Singular value decomposition

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations