Scheduling-Aware Prefetching: Enabling the PCIe SSD to Extend the Global Memory of GPU Device

Wang T.-Y; Wu C.-F; Chang Y.-H; TEI-WEI KUO; Tsao C.-W; Wang T.-Y;Wu C.-F;Tsao C.-W;Chang Y.-H;Kuo T.-W.

doi:10.1109/NVMSA53655.2021.9628829

Scheduling-Aware Prefetching: Enabling the PCIe SSD to Extend the Global Memory of GPU Device

Journal

Proceedings - 10th IEEE Non-Volatile Memory Systems and Applications Symposium, NVMSA 2021

Date Issued

2021

Author(s)

Wang T.-Y

Wu C.-F

Chang Y.-H

TEI-WEI KUO

Tsao C.-W

DOI

10.1109/NVMSA53655.2021.9628829

URI

https://www.scopus.com/inward/record.uri?eid=2-s2.0-85124033327&doi=10.1109%2fNVMSA53655.2021.9628829&partnerID=40&md5=d451103d98cdacc15ec06c72e3ba9c54

https://scholars.lib.ntu.edu.tw/handle/123456789/607472

Abstract

The evolution of Cyber-Physical Systems (CPSs) and Internet of Things (IoTs) enables mobile and smart embedded devices to be equipped with embedded GPUs for accelerating data-intensive applications. To cut down device prices and reduce energy consumption, current GPUs adopt the unified memory architecture to extend memory size with using the PCIe SSD which is cheaper than directly enlarging the off-chip DRAM on the GPU. However, adopting the unified memory architecture, data shall be moved to the host DRAM before being moved to the off-chip DRAM and thus it leads to serious contention issues among CPUs and GPUs on the host DRAM. Although the advent of new communication technology provides the opportunity for GPUs to directly access the PCIe SSD without passing the host DRAM, it leads to high data movement costs because the latency gap between the off-chip DRAM and the PCIe SSD is large. To enhance the performance of the low-cost energy-efficient GPU memory systems, this work advocates a hardware-controller-based memory extension solution to not only avoid the contention issues on the host DRAM but also reduce the data movement costs. Particularly, we propose a scheduling-aware prefetching design to perform data prefetching by utilizing the information from the hardware warp scheduler. The proposed solution was evaluated by a series of intensive experiments and the results are encouraging. ? 2021 IEEE.

Subjects

Cost reduction

Dynamic random access storage

Embedded systems

Energy efficiency

Energy utilization

Graphics processing unit

Green computing

Memory architecture

Scheduling

'current

Communicationtechnology

Data movements

Data-intensive application

Embedded device

Memory size

Off-chip

Performance

Prefetching

Reduce energy consumption

Program processors

SDGs

[SDGs]SDG7

[SDGs]SDG11

Type

conference paper

Scheduling-Aware Prefetching: Enabling the PCIe SSD to Extend the Global Memory of GPU Device

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)