Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform

2018 
Convolutional neural networks (CNNs) have emerged to provide powerful discriminative capability, especially in the field of image recognition and object detection. However, their massive computation requirements, storage and memory accesses make them hard to be deployed on mobile or embedded systems. In this paper, a few optimizations for a CNN cascade face detection algorithm are proposed to increase throughput while minimizing computation, storage and bandwidth requirement under power constraints. First, the first net of the CNN cascade is converted to a fully convolutional network to reduce 83% of the computation. Second, network retraining is applied to quantize the model parameters from 32-bit floating-point to 2-bit fixed-point, resulting in 93.75% less parameter memory size. Finally, a CNN accelerator is implemented on a Xilinx ZYNQ FPGA board.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    18
    References
    1
    Citations
    NaN
    KQI
    []