https://scholars.lib.ntu.edu.tw/handle/123456789/394603
標題: | Personalized Speech Recognizer with Keyword-based Personalized Lexicon and Language Model using Word Vector Representations | 作者: | Ching-Feng Yeh Yuan-ming Liou HUNG-YI LEE LIN-SHAN LEE |
關鍵字: | Adaptation; Personalization; Speech recognition; Word vector | 公開日期: | 九月-2015 | 起(迄)頁: | 3521-3525 | 來源出版物: | Interspeech | 摘要: | The popularity of mobile devices offers an ideal platform for personalized recognizers. With data collected from the user, the personalized recognizer with better matched acoustic and linguistic characteristics can offer not only better recognition accuracy but also less computational time. In this paper, we propose a scenario that a small data set (500 utterances with annotation) can be collected for each user and used to personalize the recognizer. Based on this scenario, we present an overall framework for accuracy improvement and computational time reduction. We train Gaussian Mixture Models (GMMs) based on the word vector representations [1][2] and develop word clusters and keyword extraction approaches for personalization of the lexicon and language model. Prototype recognition systems with CD-DNN-HMM [3][4][5] acoustic models adapted by fDLR [6][7][8][9] were implemented and tested for 10 target users. It was shown that the personalized lexicon may include much more user-specific words not obtained before, and significant performance improvement in terms of tradeoff relationships between recognition accuracy and real time factor was observed. Copyright © 2015 ISCA. |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-84959083231&partnerID=40&md5=6ec5753fbd556bbb2e67f2159fd05ca2 | ISSN: | 2308457X |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。