Boosting Self-Supervised Embeddings for Speech Enhancement

Hung, Kuo Hsuan; Fu, Szu Wei; Tseng, Huan Hsin; Chiang, Hsin Tien; Tsao, Yu; CHII-WANN LIN

標題:	Boosting Self-Supervised Embeddings for Speech Enhancement
作者:	Hung, Kuo Hsuan Fu, Szu Wei Tseng, Huan Hsin Chiang, Hsin Tien Tsao, Yu CHII-WANN LIN
關鍵字:	cross-domain feature \| noise robustness \| Self-supervised learning
公開日期:	1-一月-2022
卷:	2022-September
來源出版物:	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
摘要:	Self-supervised learning (SSL) representation for speech has achieved state-of-the-art (SOTA) performance on several downstream tasks. However, there remains room for improvement in speech enhancement (SE) tasks. In this study, we used a cross-domain feature to solve the problem that SSL embeddings may lack fine-grained information to regenerate speech signals. By integrating the SSL representation and spectrogram, the result can be significantly boosted. We further study the relationship between the noise robustness of SSL representation via clean-noisy distance (CN distance) and the layer importance for SE. Consequently, we found that SSL representations with lower noise robustness are more important. Furthermore, our experiments on the VCTK-DEMAND dataset demonstrated that fine-tuning an SSL representation with an SE model can outperform the SOTA SSL-based SE methods in PESQ, CSIG and COVL without invoking complicated network architectures. In later experiments, the CN distance in SSL embeddings was observed to increase after fine-tuning. These results verify our expectations and may help design SE-related SSL training in the future.
URI:	https://scholars.lib.ntu.edu.tw/handle/123456789/633655
ISSN:	2308457X
DOI:	10.21437/Interspeech.2022-10002
顯示於：	電機工程學系

顯示文件完整紀錄

SCOPUS^TM
Citations

checked on 2023/11/25

Page view(s)

checked on 2024/4/27

Google Scholar^TM

檢查

Altmetric

TAIR相關文章

SCOPUSTM Citations

Page view(s)

Google ScholarTM

Altmetric

Altmetric

SCOPUS^TM
Citations

Google Scholar^TM