https://scholars.lib.ntu.edu.tw/handle/123456789/625032
標題: | Optimal Bayesian strategies for the infinite-armed Bernoulli bandit | 作者: | Hung Y.-C. YING-CHAO HUNG |
關鍵字: | Bandit problem; Bayesian strategy; Bernoulli arms; Prior distribution | 公開日期: | 2012 | 卷: | 142 | 期: | 1 | 起(迄)頁: | 86-94 | 來源出版物: | Journal of Statistical Planning and Inference | 摘要: | We consider the bandit problem with an infinite number of Bernoulli arms, of which the unknown parameters are assumed to be i.i.d. random variables with a common distribution F. Our goal is to construct optimal strategies of choosing "arms" so that the expected long-run failure rate is minimized. We first review a class of strategies and establish their asymptotic properties when F is known. Based on the results, we propose a new strategy and prove that it is asymptotically optimal when F is unknown. Finally, we show that the proposed strategy performs well for a number of simulation scenarios. © 2011 Elsevier B.V. |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-80052283185&doi=10.1016%2fj.jspi.2011.06.026&partnerID=40&md5=89933d450e514294dcb204de0b734482 https://scholars.lib.ntu.edu.tw/handle/123456789/625032 |
ISSN: | 03783758 | DOI: | 10.1016/j.jspi.2011.06.026 |
顯示於: | 工業工程學研究所 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。