https://scholars.lib.ntu.edu.tw/handle/123456789/499914
標題: | Speaker verification using kernel-based binary classifiers with binary operation derived features | 作者: | Lee, H.-S. Tso, Y. Chang, Y.-F. Wang, H.-M. SHYH-KANG JENG |
關鍵字: | DNN; i-vector; speaker verification; SVM | 公開日期: | 2014 | 起(迄)頁: | 1660-1664 | 來源出版物: | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings | 摘要: | In this paper, we study the use of two kinds of kernel-based discriminative models, namely support vector machine (SVM) and deep neural network (DNN), for speaker verification. We treat the verification task as a binary classification problem, in which a pair of two utterances, each represented by an i-vector, is assumed to belong to either the 'within-speaker' group or the 'between-speaker' group. To solve the problem, we employ various binary operations to retain the basic relationship between any pair of i-vectors to form a single vector for training the discriminative models. This study also investigates the correlation of achievable performances with the number of training pairs and the various combinations of basic binary operations, using the SVM and DNN binary classifiers. The experiments are conducted on the male portion of the core task in the NIST 2005 Speaker Recognition Evaluation (SRE), and the results are competitive or even better, in terms of normalized decision cost function (minDCF) and equal error rate (EER), while compared to other non-probabilistic based models, such as the conventional speaker SVMs and the LDA-based cosine distance scoring. © 2014 IEEE. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/499914 https://www.scopus.com/inward/record.uri?eid=2-s2.0-84905238194&doi=10.1109%2fICASSP.2014.6853880&partnerID=40&md5=9597cbd9e4751716d9a340af97c87a68 |
ISSN: | 15206149 | DOI: | 10.1109/ICASSP.2014.6853880 | SDG/關鍵字: | Classification (of information); Signal processing; Support vector machines; Vectors; Achievable performance; Binary classification problems; Cosine distance scoring; DNN; I vectors; Speaker recognition evaluations; Speaker verification; SVM; Speech recognition |
顯示於: | 電機工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。