https://scholars.lib.ntu.edu.tw/handle/123456789/580595
標題: | A Hybrid DQN and Optimization Approach for Strategy and Resource Allocation in MEC Networks | 作者: | Wu Y Dinh T.Q Fu Y Lin C Quek T.Q.S. CHE LIN |
關鍵字: | Convex optimization; Fading channels; Game theory; Reinforcement learning; Resource allocation; Communication resources; Convergence behaviors; Nonconvex problem; Optimization algorithms; Optimization approach; Resource allocation problem; Resource allocation strategies; Time varying fading channels; Network architecture | 公開日期: | 2021 | 來源出版物: | IEEE Transactions on Wireless Communications | 摘要: | We consider a multi-user multi-server mobile edge computing (MEC) network with time-varying fading channels and formulate an offloading decision and resource allocation problem. To solve this mixed-integer non-convex problem, we propose two hybrid approaches that learn offloading strategy with DQN (opt-DQN) or Q-table (opt-QL) at each user equipment (UE). The communication resources are allocated with an optimization algorithm at each computational access point (CAP). We also propose a pure DQN method that learns both the offloading strategy and resource allocation via Q-learning (QL). We analyze the convergence behavior of the QL-based algorithms from a game-theoretical perspective and demonstrate the performance of the proposed hybrid approaches for different network sizes. The simulation results show that the hybrid approaches reach lower costs than other baseline algorithms and the pure-DQN approach. Moreover, the performance of the pure-DQN approach degrades severely as the network size increases, while opt-DQN still performs the best, followed by opt-QL. These observations demonstrate that the hybrid approach that combines the advantages of both QL and convex optimization is a promising design for a multi-user MEC network, wherein complicated offloading and resource allocation strategies need to be determined in a timely and accurate fashion. IEEE |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85100930827&doi=10.1109%2fTWC.2021.3057882&partnerID=40&md5=c2faca7b1b19d3e566712d6a5b02efcb https://scholars.lib.ntu.edu.tw/handle/123456789/580595 |
ISSN: | 15361276 | DOI: | 10.1109/TWC.2021.3057882 | SDG/關鍵字: | Convex optimization; Fading channels; Game theory; Reinforcement learning; Resource allocation; Communication resources; Convergence behaviors; Nonconvex problem; Optimization algorithms; Optimization approach; Resource allocation problem; Resource allocation strategies; Time varying fading channels; Network architecture |
顯示於: | 電機工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。