Personalized Speech Recognizer with Keyword-based Personalized Lexicon and Language Model using Word Vector Representations

Ching-Feng Yeh;Yuan-ming Liou;Hung-yi Lee;Lin-shan Lee

標題:	Personalized Speech Recognizer with Keyword-based Personalized Lexicon and Language Model using Word Vector Representations
作者:	Ching-Feng Yeh Yuan-ming Liou HUNG-YI LEE LIN-SHAN LEE
關鍵字:	Adaptation; Personalization; Speech recognition; Word vector
公開日期:	九月-2015
起(迄)頁:	3521-3525
來源出版物:	Interspeech
摘要:	The popularity of mobile devices offers an ideal platform for personalized recognizers. With data collected from the user, the personalized recognizer with better matched acoustic and linguistic characteristics can offer not only better recognition accuracy but also less computational time. In this paper, we propose a scenario that a small data set (500 utterances with annotation) can be collected for each user and used to personalize the recognizer. Based on this scenario, we present an overall framework for accuracy improvement and computational time reduction. We train Gaussian Mixture Models (GMMs) based on the word vector representations [1][2] and develop word clusters and keyword extraction approaches for personalization of the lexicon and language model. Prototype recognition systems with CD-DNN-HMM [3][4][5] acoustic models adapted by fDLR [6][7][8][9] were implemented and tested for 10 target users. It was shown that the personalized lexicon may include much more user-specific words not obtained before, and significant performance improvement in terms of tradeoff relationships between recognition accuracy and real time factor was observed. Copyright © 2015 ISCA.
URI:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-84959083231&partnerID=40&md5=6ec5753fbd556bbb2e67f2159fd05ca2
ISSN:	2308457X
顯示於：	資訊工程學系

顯示文件完整紀錄

Page view(s)

checked on 2024/4/20

Google Scholar^TM

檢查

TAIR相關文章

Page view(s)

Google ScholarTM

Google Scholar^TM