Merging Well-Trained Deep CNN Models for Efficient Inference

Wu C.-E; Lee J.-H; Wan T.S.T; Chan Y.-M; CHU-SONG CHEN

Merging Well-Trained Deep CNN Models for Efficient Inference

Journal

2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020 - Proceedings

Pages

1594-1600

Date Issued

2020

Author(s)

Wu C.-E

Lee J.-H

Wan T.S.T

Chan Y.-M

CHU-SONG CHEN

URI

https://www.scopus.com/inward/record.uri?eid=2-s2.0-85100946038&partnerID=40&md5=2e05b66345f9934578958b303633eac8

https://scholars.lib.ntu.edu.tw/handle/123456789/581332

Abstract

In signal processing applications, more than one tasks often have to be integrated into a system. Deep learning models (such as convolutional neural networks) of multiple purposes have to be executed simultaneously. When deploying multiple well-trained models to an application system, running them simultaneously is inefficient due to the collective loads of computation. Hence, merging the models into a more compact one is often required, so that they can be executed more efficiently on resource-limited devices. When deploying two or more well-trained deep neural-network models in the inference stage, we introduce an approach that fuses the models into a condensed model. The proposed approach consists of three phases: Filter Alignment, Shared-weight Initialization, and Model Calibration. It can merge well-trained feed-forward neural networks of the same architecture into a single network to reduce online storage and inference time. Experimental results show that our approach can improve both the run-time memory compression ratio and increase the computational speed in the execution. ? 2020 APSIPA.

Subjects

Convolutional neural networks; Deep learning; Deep neural networks; Merging; Network architecture; Signal processing; Application systems; Computational speed; Memory compression; Model calibration; Neural network model; Resource-limited devices; Signal processing applications; Weight initialization; Feedforward neural networks

Type

conference paper

Merging Well-Trained Deep CNN Models for Efficient Inference

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)