Pronunciation Modeling with Reduced Confusion for Mandarin Chinese Using A Three-stage Framework
Resource
IEEE Transactions on Audio, Speech, and Language Processing 15 (2): 661-675
Journal
IEEE Transactions on Audio, Speech and Language Processing
Journal Volume
15
Journal Issue
2
Pages
661 - 675
Date Issued
2007
Date
2007
Author(s)
Abstract
Multiple-pronunciation dictionaries have been found to be useful in pronunciation modeling for speech recognition. However, the extra pronunciation variants added in the dictionary inevitably increase the confusion among different words during recognition, and consequently limit the achievable improvements in the recognition performance. This paper proposes a three-stage framework for Mandarin Chinese to construct automatically the multiple-pronunciation dictionary while reducing the possible confusion caused. The proposed framework includes pronunciation generation (Stage 1), ranking (Stage 2) and pruning (Stage3). New measures of confusability for multiple-pronunciation dictionaries were developed and shown to have a very strong correlation with recognition performance. With the proposed framework, it was shown that the confusability as measured can be reduced and recognition performance improved stage by stage. All of the above findings were verified by a series of experiments performed on both planned (LDC HUB-4NE) and spontaneous (LDC CALLHOME) Mandarin Chinese speech corpora. © 2006 IEEE.
Subjects
Confusability; Confusion; Multiple-pronunciation dictionary; Pronunciation modeling; Pronunciation variation; Speech recognition
SDGs
Other Subjects
Confusability; Confusion; Multiple-pronunciation dictionary; Pronunciation modeling; Pronunciation variation; Speech analysis; Speech recognition
Type
journal article
File(s)![Thumbnail Image]()
Loading...
Name
20.pdf
Size
1.27 MB
Format
Adobe PDF
Checksum
(MD5):3879a986bbe75f48b757805eca8d4437
