https://scholars.lib.ntu.edu.tw/handle/123456789/632410
標題: | Automatic Segmental and Prosodic Labeling of Mandarin Speech Database | 作者: | Chou F.-C Tseng C.-Y LIN-SHAN LEE |
公開日期: | 1998 | 來源出版物: | 5th International Conference on Spoken Language Processing, ICSLP 1998 | 摘要: | In this paper we describe the techniques and methodology developed for automatic labeling of segmental and prosodic information for the Mandarin speech database. There are two major procedures. First, the text is converted into the phonetic network of possible pronunciations, and this network is aligned with the speech data by recognition processes. Secondly, many acoustic prosodic features are derived and the break indices are labeled with these features by decision trees. For the segmental labeling, 96.5% of automatically determined segment boundaries are accurate within a range of 20 ms. For the prosodic labeling, 84.9% of the automatic labeled break indices are the same with the manual labeled one. © 1998. 5th International Conference on Spoken Language Processing, ICSLP 1998. All rights reserved. |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-84890480439&partnerID=40&md5=cc0877f2bca3a817982dcc01c4ab8366 https://scholars.lib.ntu.edu.tw/handle/123456789/632410 |
SDG/關鍵字: | Character recognition; Speech recognition; Automatic labelling; Break indices; Prosodic features; Prosodic labeling; Prosodics; Recognition process; Segmental labeling; Speech data; Speech database; Decision trees |
顯示於: | 電機工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。