https://scholars.lib.ntu.edu.tw/handle/123456789/413002
標題: | Automatic facial image annotation and retrieval by integrating voice label and visual appearance | 作者: | Jheng H.-W. Chen B.-C. Chen Y.-Y. WINSTON HSU |
關鍵字: | Face annotation; Image retrieval; Spoken annotation | 公開日期: | 2014 | 起(迄)頁: | 1001-1004 | 來源出版物: | 2014 ACM Conference on Multimedia | 摘要: | Annotation is important for managing and retrieving a large amount of photos, but it is generally labor-intensive and time-consuming. However, speaking while taking photos is straightforward and effortless, and using voice for annotation is faster than typing words. To best reduce the manual cost of annotating photos, we propose a novel framework which utilizes the scarce spoken annotations recorded while capturing as voice labels and automatically label every facial image in the photo collection. To accomplish this goal, we employ a probabilistic graphical model which integrates voice labels and visual appearances for inference. Combined with group prior estimation and gender attribute association, we can achieve an outstanding performance on the proposed synthesized group photo collections. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/413002 | ISBN: | 9781450330633 | DOI: | 10.1145/2647868.2655015 |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。