Low Precision Deep Learning Training on Mobile Heterogeneous Platform

Valery, O.; Liu, P.; PANGFENG LIU; Valery, O.;Liu, P.;Wu, J.-J.

doi:10.1109/PDP2018.2018.00023

Low Precision Deep Learning Training on Mobile Heterogeneous Platform

Journal

Proceedings - 26th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2018

Pages

109-117

Date Issued

2018

Author(s)

Valery, O.

Liu, P.

PANGFENG LIU

DOI

10.1109/PDP2018.2018.00023

URI

https://scholars.lib.ntu.edu.tw/handle/123456789/489214

URL

https://www.scopus.com/inward/record.uri?eid=2-s2.0-85048761947&doi=10.1109%2fPDP2018.2018.00023&partnerID=40&md5=9e2ab15d9ead2bc58cefb49b865610a4

Abstract

Recent advances in System-on-Chip architectures have made the use of deep learning suitable for a number of applications on mobile devices. Unfortunately, due to the computational cost of neural network training, it is often limited to inference task, e.g., prediction, on mobile devices. In this paper, we propose a deep learning framework that enables both deep learning training and inference tasks on mobile devices. While being able to accommodate with the heterogeneity of computing devices technology on mobile devices, it also uses OpenCL to efficiently leverage modern SoC capabilities, e.g., multi-core CPU, integrated GPU and shared memory architecture, and accelerate deep learning computation. In addition, our system encodes the arithmetic operations of deep networks down to 8-bit fixed-point on mobile devices. As a proof of concept, we trained three well-known neural networks on mobile devices and exhibited a significant performance gain, energy consumption reduction, and memory saving. © 2018 IEEE.

SDGs

[SDGs]SDG7

Other Subjects

Energy utilization; Fixed point arithmetic; Memory architecture; Mobile computing; Network architecture; Neural networks; Program processors; Programmable logic controllers; System-on-chip; GPGPU; Heterogeneous platforms; Heterogeneous systems; Neural network training; Opencl; Shared memory architecture; System-on-chip architecture; Transfer learning; Deep learning

Type

conference paper

Low Precision Deep Learning Training on Mobile Heterogeneous Platform

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)