What does a network layer hear? analyzing hidden representations of end-to-end asr through speech synthesis

Li, C.-Y.; Yuan, P.-C.; HUNG-YI LEE

doi:10.1109/ICASSP40776.2020.9054675

What does a network layer hear? analyzing hidden representations of end-to-end asr through speech synthesis

Journal

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Journal Volume

2020-May

Pages

6434-6438

Date Issued

2020

Author(s)

Li, C.-Y.

Yuan, P.-C.

HUNG-YI LEE

DOI

10.1109/ICASSP40776.2020.9054675

URI

https://www.scopus.com/inward/record.url?eid=2-s2.0-85091313173&partnerID=40&md5=533f36d93c07c028a7e28a04ad71f224

https://scholars.lib.ntu.edu.tw/handle/123456789/558963

Abstract

End-to-end speech recognition systems have achieved competitive results compared to traditional systems. However, the complex transformations involved between layers given highly variable acoustic signals are hard to analyze. In this paper, we present our ASR probing model, which synthesizes speech from hidden representations of end-to-end ASR to examine the information maintained after each layer calculation. Listening to the synthesized speech, we observe gradual removal of speaker variability and noise as the layer goes deeper, which aligns with the previous studies on how deep network functions in speech recognition. This paper is the first study analyzing the end-to-end speech recognition model by demonstrating what each layer hears. Speaker verification and speech enhancement measurements on synthesized speech are also conducted to confirm our observation further. © 2020 Institute of Electrical and Electronics Engineers Inc.. All rights reserved.

Subjects

Analysis; Automatic speech recognition; End-toend; Interpretability

Type

conference paper

What does a network layer hear? analyzing hidden representations of end-to-end asr through speech synthesis

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)