End-to-End Whispered Speech Recognition with Frequency-Weighted Approaches and Pseudo Whisper Pre-training

Chang H.-J;Liu A.H;Lee H.-Y;Lee L.-S.

標題:	End-to-End Whispered Speech Recognition with Frequency-Weighted Approaches and Pseudo Whisper Pre-training
作者:	Chang H.-J Liu A.H Lee H.-Y HUNG-YI LEE LIN-SHAN LEE
關鍵字:	Speech; Transfer learning; Feature extractor; Frequency weighted; High frequency HF; Human speech; Layer-wise; Pre-training; Relative reduction; Whispered speech; Speech recognition
公開日期:	2021
起(迄)頁:	186-193
來源出版物:	2021 IEEE Spoken Language Technology Workshop, SLT 2021 - Proceedings
摘要:	Whispering is an important mode of human speech, but no end-to-end recognition results for it were reported yet, probably due to the scarcity of available whispered speech data. In this paper, we present several approaches for end-to-end (E2E) recognition of whispered speech considering the special characteristics of whispered speech and the scarcity of data. This includes a frequency-weighted SpecAugment policy and a frequency-divided CNN feature extractor for better capturing the high-frequency structures of whispered speech, and a layer-wise transfer learning approach to pre-train a model with normal or normal-to-whispered converted speech then fine-tune it with whispered speech to bridge the gap between whispered and normal speech. We achieve an overall relative reduction of 19.8% in PER and 44.4% in CER on a relatively small whispered TIMIT corpus. The results indicate as long as we have a good E2E model pre-trained on normal or pseudo-whispered speech, a relatively small set of whispered speech may suffice to obtain a reasonably good E2E whispered speech recognizer. ? 2021 IEEE.
URI:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85103924610&doi=10.1109%2fSLT48900.2021.9383595&partnerID=40&md5=5eee06475284ca9a7e017c50c81ca4ec https://scholars.lib.ntu.edu.tw/handle/123456789/580906
DOI:	10.1109/SLT48900.2021.9383595
顯示於：	電機工程學系

顯示文件完整紀錄

SCOPUS^TM
Citations

checked on 2023/12/27

Page view(s)

checked on 2024/4/20

Google Scholar^TM

檢查

Altmetric

TAIR相關文章

SCOPUSTM Citations

Page view(s)

Google ScholarTM

Altmetric

Altmetric

SCOPUS^TM
Citations

Google Scholar^TM