https://scholars.lib.ntu.edu.tw/handle/123456789/498661
標題: | Improved tone recognition for fluent Mandarin speech based on new inter-syllabic features and robust pitch extraction | 作者: | Lin, W.-Y. LIN-SHAN LEE |
公開日期: | 2003 | 起(迄)頁: | 237-242 | 來源出版物: | 2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003 | 會議論文: | IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003 | 摘要: | Tone recognition for fluent Mandarin speech has always been a very difficult problem, because the pitch contours vary seriously with the context conditions and the complicated tone behavior is difficult to analyze. In this paper, a new set of four inter-syllabic features are identified to characterize quantitatively such pitch contour variation with respect to the context conditions. In addition, a robust pitch extraction method is proposed by integrating the Adaptive Gabor Representation (AGR) and Instantaneous Frequency Amplitude Spectrum (IFAS). Experimental results indicate that accurate pitch values can be extracted under various noisy conditions, and the tone recognition accuracy can be improved significantly. © 2003 IEEE. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/498661 https://www.scopus.com/inward/record.uri?eid=2-s2.0-34547546161&doi=10.1109%2fASRU.2003.1318447&partnerID=40&md5=3d067ad24336661620aaeb74b0104dc4 |
DOI: | 10.1109/ASRU.2003.1318447 | SDG/關鍵字: | Continuous speech recognition; Extraction; Instantaneous frequency amplitude spectrum; Noisy conditions; Pitch contours; Pitch extraction; Pitch values; Tone recognition; Speech recognition |
顯示於: | 電機工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。