Wu Q.CHUNG-WEI LIN2025-05-192025-05-192024-01-0110495258https://www.scopus.com/record/display.uri?eid=2-s2.0-105000491713&origin=resultslisthttps://scholars.lib.ntu.edu.tw/handle/123456789/729472falseVariational Delayed Policy Optimizationconference paper2-s2.0-105000491713