Accelerating Convolutional Neural Networks via Inter-operator Scheduling

You, Yi; PANGFENG LIU; Hong, Ding Yong; Wu, Jan Jan; Hsu, Wei Chung

doi:10.1109/ICPADS56603.2022.00123

Accelerating Convolutional Neural Networks via Inter-operator Scheduling

Journal

Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS

Journal Volume

2023-January

ISBN

9781665473156

Date Issued

2023-01-01

Author(s)

You, Yi

PANGFENG LIU

Hong, Ding Yong

Wu, Jan Jan

Hsu, Wei Chung

DOI

10.1109/ICPADS56603.2022.00123

URI

https://scholars.lib.ntu.edu.tw/handle/123456789/631004

URL

https://api.elsevier.com/content/abstract/scopus_id/85152885530

Abstract

Convolution neural networks (CNNs) are essential in many machine learning tasks. Current deep learning frameworks and compilers usually treat the neutral network as a DAG (directed acyclic graph) of tensor operations and execute them one at a time according to a topological order, which respects the dependency in the DAG. There are two issues with this general approach. First, new CNNs have branch structures, and they form complex DAGs. These DAGs make it hard to find a good topology sort order that schedules operators within a GPU. Second, modern hardware has high computational power, which makes running operators sequentially on modern hardware under-utilizes resources. These two issues open the possibility of exploiting inter-operator parallelism, i.e., parallelism among independent operators in the DAG, to utilize the hardware resources more efficiently. In this work, we formally define the DAG scheduling problem that addresses the resource contention and propose an early-start-time-first algorithm with two heuristic rules for exploiting parallelism between independent operators. Experimental results show that our method improves the performance by up to 3.76× on RTX 3090 compared to the sequential execution.

Subjects

Convolution-neural-networks | Inter-operator-parallelism | Machine-learning

SDGs

[SDGs]SDG3

Type

conference paper

Accelerating Convolutional Neural Networks via Inter-operator Scheduling

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)