One shot learning for speech separation

Wu Y.-K; Huang K.-P; Tsao Y; HUNG-YI LEE; Wu Y.-K;Huang K.-P;Tsao Y;Lee H.-Y.

doi:10.1109/ICASSP39728.2021.9413956

One shot learning for speech separation

Journal

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Journal Volume

2021-June

Pages

5769-5773

Date Issued

2021

Author(s)

Wu Y.-K

Huang K.-P

Tsao Y

HUNG-YI LEE

DOI

10.1109/ICASSP39728.2021.9413956

URI

https://www.scopus.com/inward/record.uri?eid=2-s2.0-85115144481&doi=10.1109%2fICASSP39728.2021.9413956&partnerID=40&md5=3e90b74bb231d719552774e444544ec9

https://scholars.lib.ntu.edu.tw/handle/123456789/607155

Abstract

Despite the recent success of speech separation models, they fail to separate sources properly while facing different sets of people or noisy environments. To tackle this problem, we proposed to apply meta-learning to the speech separation task. We aimed to find a meta-initialization model, which can quickly adapt to new speakers by seeing only one mixture generated by those people. In this paper, we use model-agnostic meta-learning(MAML) algorithm and almost no inner loop(ANIL) algorithm in Conv-TasNet to achieve this goal. The experiment results show that our model can adapt not only to a new set of speakers but also noisy environments. Furthermore, we found out that the encoder and decoder serve as the feature-reuse layers, while the separator is the task-specific module. ? 2021 IEEE

Subjects

ANIL

MAML

Meta-learning

Speech separation

Separation

Speech analysis

Feature reuse

Inner loops

Metalearning

Noisy environment

One-shot learning

Task-specific modules

Use-model

Source separation

Type

conference paper

One shot learning for speech separation

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)