EIE: efficient inference engine on compressed deep neural network

Song Han,Xingyu Liu,Huizi Mao,Jing Pu,Ardavan Pedram,Mark Horowitz,William J. Dally

EIE: efficient inference engine on compressed deep neural network

2016

Song Han
Xingyu Liu
Huizi Mao
Jing Pu
Ardavan Pedram
Mark Horowitz
William J. Dally

State-of-the-art deep neural networks (DNNs) have hundreds of millions of connections and are both computationally and memory intensive, making them difficult to deploy on embedded systems with limited hardware resources and power budgets. While custom hardware helps the computation, fetching weights from DRAM is two orders of magnitude more expensive than ALU operations, and dominates the required power.

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations