Pronunciation Modeling with Reduced Confusion for Mandarin Chinese Using A Three-stage Framework

Tsai, M.-Y.; Chou, F.-C.; LIN-SHAN LEE; Tsai, M.-Y.; Chou, F.-C.; Lee, L.-S.

doi:10.1109/TASL.2006.876769

Pronunciation Modeling with Reduced Confusion for Mandarin Chinese Using A Three-stage Framework

Resource

IEEE Transactions on Audio, Speech, and Language Processing 15 (2): 661-675

Journal

IEEE Transactions on Audio, Speech and Language Processing

Journal Volume

15

Journal Issue

2

Pages

661 - 675

Date Issued

2007

Date

2007

Author(s)

Tsai, M.-Y.

Chou, F.-C.

LIN-SHAN LEE

DOI

10.1109/TASL.2006.876769

URI

http://ntur.lib.ntu.edu.tw//handle/246246/142104

https://www.scopus.com/inward/record.uri?eid=2-s2.0-60849085902&doi=10.1109%2fTASL.2006.876769&partnerID=40&md5=f30b2e451eeb362694b53aeb449a3f18

Abstract

Multiple-pronunciation dictionaries have been found to be useful in pronunciation modeling for speech recognition. However, the extra pronunciation variants added in the dictionary inevitably increase the confusion among different words during recognition, and consequently limit the achievable improvements in the recognition performance. This paper proposes a three-stage framework for Mandarin Chinese to construct automatically the multiple-pronunciation dictionary while reducing the possible confusion caused. The proposed framework includes pronunciation generation (Stage 1), ranking (Stage 2) and pruning (Stage3). New measures of confusability for multiple-pronunciation dictionaries were developed and shown to have a very strong correlation with recognition performance. With the proposed framework, it was shown that the confusability as measured can be reduced and recognition performance improved stage by stage. All of the above findings were verified by a series of experiments performed on both planned (LDC HUB-4NE) and spontaneous (LDC CALLHOME) Mandarin Chinese speech corpora. © 2006 IEEE.

Subjects

Confusability; Confusion; Multiple-pronunciation dictionary; Pronunciation modeling; Pronunciation variation; Speech recognition

SDGs

[SDGs]SDG4

Other Subjects

Confusability; Confusion; Multiple-pronunciation dictionary; Pronunciation modeling; Pronunciation variation; Speech analysis; Speech recognition

Type

journal article

File(s)

Name

20.pdf

Size

1.27 MB

Format

Adobe PDF

Checksum

(MD5):3879a986bbe75f48b757805eca8d4437

Pronunciation Modeling with Reduced Confusion for Mandarin Chinese Using A Three-stage Framework

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)