Learning key steps to attack deep reinforcement learning agents

標題:	Learning key steps to attack deep reinforcement learning agents
作者:	Yu, Chien Min Chen, Ming Hsin HSUAN-TIEN LIN
關鍵字:	Adversarial attacks \| Deep learning \| Reinforcement learning \| Robustness
公開日期:	1-一月-2023
來源出版物:	Machine Learning
摘要:	Deep reinforcement learning agents are vulnerable to adversarial attacks. In particular, recent studies have shown that attacking a few key steps can effectively decrease the agent’s cumulative reward. However, all existing attacking methods define those key steps with human-designed heuristics, and it is not clear how more effective key steps can be identified. This paper introduces a novel reinforcement learning framework that learns key steps through interacting with the agent. The proposed framework does not require any human heuristics nor knowledge, and can be flexibly coupled with any white-box or black-box adversarial attack scenarios. Experiments on benchmark Atari games across different scenarios demonstrate that the proposed framework is superior to existing methods for identifying effective key steps. The results highlight the weakness of RL agents even under budgeted attacks.
URI:	https://scholars.lib.ntu.edu.tw/handle/123456789/629898
ISSN:	08856125
DOI:	10.1007/s10994-023-06318-9
顯示於：	資訊工程學系

顯示文件完整紀錄

checked on 2023/11/13

checked on 2023/10/31

checked on 2024/4/27

檢查

TAIR相關文章