https://scholars.lib.ntu.edu.tw/handle/123456789/502351
標題: | Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform | 作者: | Yu, B.-S. Tsao, Y. Yang, S.-W. Chen, Y.-K. SHAO-YI CHIEN |
公開日期: | 2018 | 卷: | 2018-October | 起(迄)頁: | 88-93 | 來源出版物: | IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation | 摘要: | Convolutional neural networks (CNNs) have emerged to provide powerful discriminative capability, especially in the field of image recognition and object detection. However, their massive computation requirements, storage and memory accesses make them hard to be deployed on mobile or embedded systems. In this paper, a few optimizations for a CNN cascade face detection algorithm are proposed to increase throughput while minimizing computation, storage and bandwidth requirement under power constraints. First, the first net of the CNN cascade is converted to a fully convolutional network to reduce 83% of the computation. Second, network retraining is applied to quantize the model parameters from 32-bit floating-point to 2-bit fixed-point, resulting in 93.75% less parameter memory size. Finally, a CNN accelerator is implemented on a Xilinx ZYNQ FPGA board. © 2018 IEEE. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/502351 | ISSN: | 15206130 | DOI: | 10.1109/SiPS.2018.8598428 | SDG/關鍵字: | Convolution; Digital arithmetic; Embedded systems; Field programmable gate arrays (FPGA); Integrated circuit design; Neural networks; Object detection; Silicon compounds; Architecture designs; Bandwidth requirement; Convolutional networks; Convolutional neural network; Face detection algorithm; Model parameters; Power constraints; quantization; Face recognition |
顯示於: | 電機工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。