Hung W.SHAO-HUA SUNHsieh P.-C.2025-10-232025-10-232025-01-01[9798331320850]https://scholars.lib.ntu.edu.tw/handle/123456789/732835falseEFFICIENT ACTION-CONSTRAINED REINFORCEMENT LEARNING VIA ACCEPTANCE-REJECTION METHOD AND AUGMENTED MDPSconference paper2-s2.0-105010205028