https://scholars.lib.ntu.edu.tw/handle/123456789/322191
標題: | Semantic context detection using audio event fusion: Camera-ready version | 作者: | Chu, W.-T. WEN-HUANG CHENG JA-LING WU |
公開日期: | 2006 | 卷: | 2006 | 起(迄)頁: | 1-12 | 來源出版物: | Eurasip Journal on Applied Signal Processing | 摘要: | Semantic-level content analysis is a crucial issue in achieving efficient content retrieval and management. We propose a hierarchical approach that models audio events over a time series in order to accomplish semantic context detection. Two levels of modeling, audio event and semantic context modeling, are devised to bridge the gap between physical audio features and semantic concepts. In this work, hidden Markov models (HMMs) are used to model four representative audio events, that is, gunshot, explosion, engine, and car braking, in action movies. At the semantic context level, generative (ergodic hidden Markov model) and discriminative (support vector machine (SVM)) approaches are investigated to fuse the characteristics and correlations among audio events, which provide cues for detecting gunplay and car-chasing scenes. The experimental results demonstrate the effectiveness of the proposed approaches and provide a preliminary framework for information mining by using audio characteristics. Copyright © 2006 Hindawi Publishing Corporation. All rights reserved. |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-33645149573&doi=10.1155%2fASP%2f2006%2f27390&partnerID=40&md5=fc8b72e13e8ecbc3e9da28b2f8e4e027 http://scholars.lib.ntu.edu.tw/handle/123456789/322191 |
DOI: | 10.1155/ASP/2006/27390 | SDG/關鍵字: | Audio events; Audio features; Hidden Markov models (HMM); Support vector machine (SVM); Correlation methods; Database systems; Hierarchical systems; Information retrieval; Markov processes; Semantics |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。