https://scholars.lib.ntu.edu.tw/handle/123456789/489208
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Valery, O. | en_US |
dc.contributor.author | Liu, P. | en_US |
dc.contributor.author | Wu, J.-J. | en_US |
dc.contributor.author | PANGFENG LIU | zz |
dc.creator | Valery, O.;Liu, P.;Wu, J.-J. | - |
dc.date.accessioned | 2020-05-04T08:02:20Z | - |
dc.date.available | 2020-05-04T08:02:20Z | - |
dc.date.issued | 2019 | - |
dc.identifier.uri | https://scholars.lib.ntu.edu.tw/handle/123456789/489208 | - |
dc.description.abstract | As mobile devices become more prevalent, users tend to reassess their expectations regarding the personalization of mobile services. The data collected by a mobile device's sensors provide an opportunity to gain insight into the user's profile. Recently, deep learning has gained momentum and has become the method of choice for solving machine learning problems. Interestingly, training a deep neural network on a mobile device is often mistakenly regarded as cumbersome. For instance, several deep learning frameworks only provide a CPU-based implementation for prediction tasks on a mobile device. In contrast to servers, a mobile computing environment imposes many domain-specific constraints that invite us to review the general computing approach used in a deep learning framework implementation. In this paper, we propose a deep learning framework that has been specifically designed for mobile device platforms. Our approach relies on the collaboration of the multicore CPU and the integrated GPU to accelerate deep learning computation on mobile devices. Our work exploits the shared memory architecture of mobile devices to promote CPU-GPU collaboration without any data copying. We analyze our approach with regard to three factors: performance/portability trade-off, power efficiency, and memory management. © 2019 John Wiley & Sons, Ltd. | - |
dc.relation.ispartof | Concurrency Computation | - |
dc.subject | deep learning; energy efficient; GPGPU; heterogeneous system; mobile computing; OpenCL | - |
dc.subject.classification | [SDGs]SDG7 | - |
dc.subject.other | Deep neural networks; Economic and social effects; Energy efficiency; Graphics processing unit; Learning systems; Memory architecture; Mobile computing; Program processors; Energy efficient; GPGPU; Heterogeneous systems; Learning frameworks; Machine learning problem; Mobile computing environment; OpenCL; Shared memory architecture; Deep learning | - |
dc.title | A collaborative CPU-GPU approach for deep learning on mobile devices | en_US |
dc.type | journal article | en |
dc.identifier.doi | 10.1002/cpe.5225 | - |
dc.identifier.url | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85062350960&doi=10.1002%2fcpe.5225&partnerID=40&md5=105685ad30e09e87693137b1f8e89c81 | - |
dc.relation.journalvolume | 31 | - |
dc.relation.journalissue | 17 | - |
item.cerifentitytype | Publications | - |
item.fulltext | no fulltext | - |
item.openairecristype | http://purl.org/coar/resource_type/c_6501 | - |
item.openairetype | journal article | - |
item.grantfulltext | none | - |
crisitem.author.dept | Networking and Multimedia | - |
crisitem.author.dept | Computer Science and Information Engineering | - |
crisitem.author.orcid | 0000-0002-5466-9960 | - |
crisitem.author.parentorg | College of Electrical Engineering and Computer Science | - |
crisitem.author.parentorg | College of Electrical Engineering and Computer Science | - |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。