DON'T SPEAK TOO FAST: THE IMPACT OF DATA BIAS ON SELF-SUPERVISED SPEECH MODELS

HUNG-YI LEE

doi:10.1109/ICASSP43922.2022.9747897

Publication:
DON'T SPEAK TOO FAST: THE IMPACT OF DATA BIAS ON SELF-SUPERVISED SPEECH MODELS

cris.lastimport.scopus	2025-05-13T22:37:23Z
cris.virtual.department	Electrical Engineering	en_US
cris.virtual.department	Intel-NTU Connected Context Computing Center	en_US
cris.virtual.department	Communication Engineering	en_US
cris.virtual.department	Computer Science and Information Engineering	en_US
cris.virtual.department	Networking and Multimedia	en_US
cris.virtual.department	Center for Artificial Intelligence and Advanced Robotics	en_US
cris.virtual.department	Master's Program in Smart Medicine and Health Informatics (SMARTMHI)	en_US
cris.virtual.orcid	0000-0002-9654-5747	en_US
cris.virtualsource.department	0897e0f8-f71a-40d3-a313-62f0c81793df
cris.virtualsource.department	0897e0f8-f71a-40d3-a313-62f0c81793df
cris.virtualsource.department	0897e0f8-f71a-40d3-a313-62f0c81793df
cris.virtualsource.department	0897e0f8-f71a-40d3-a313-62f0c81793df
cris.virtualsource.department	0897e0f8-f71a-40d3-a313-62f0c81793df
cris.virtualsource.department	0897e0f8-f71a-40d3-a313-62f0c81793df
cris.virtualsource.department	0897e0f8-f71a-40d3-a313-62f0c81793df
cris.virtualsource.orcid	0897e0f8-f71a-40d3-a313-62f0c81793df
dc.contributor.author	Meng, Yen	en_US
dc.contributor.author	Chou, Yi Hui	en_US
dc.contributor.author	Liu, Andy T.	en_US
dc.contributor.author	HUNG-YI LEE	en_US
dc.date.accessioned	2023-04-20T10:05:07Z
dc.date.available	2023-04-20T10:05:07Z
dc.date.issued	2022-01-01
dc.description.abstract	Self-supervised Speech Models (S3Ms) have been proven successful in many speech downstream tasks, like ASR. However, how pretraining data affects S3Ms' downstream behavior remains an unexplored issue. In this paper, we study how pre-training data affects S3Ms by pre-training models on biased datasets targeting different factors of speech, including gender, content, and prosody, and evaluate these pre-trained S3Ms on selected downstream tasks in SUPERB Benchmark. Our experiments show that S3Ms have tolerance toward gender bias. Moreover, we find that the content of speech has little impact on the performance of S3Ms across downstream tasks, but S3Ms do show a preference toward a slower speech rate.	en_US
dc.identifier.doi	10.1109/ICASSP43922.2022.9747897
dc.identifier.isbn	9781665405409
dc.identifier.issn	15206149
dc.identifier.scopus	2-s2.0-85133024721
dc.identifier.uri	https://scholars.lib.ntu.edu.tw/handle/123456789/630389
dc.identifier.url	https://api.elsevier.com/content/abstract/scopus_id/85133024721
dc.relation.ispartof	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings	en_US
dc.relation.journalvolume	2022-May	en_US
dc.relation.pageend	3262	en_US
dc.subject	Data Bias \| Self-supervised Speech Models \| SUPERB Benchmark	en_US
dc.title	DON'T SPEAK TOO FAST: THE IMPACT OF DATA BIAS ON SELF-SUPERVISED SPEECH MODELS	en_US
dc.type	conference paper
dspace.entity.type	Publication

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Electrical Engineering / 電機工程學系

Publication:
DON'T SPEAK TOO FAST: THE IMPACT OF DATA BIAS ON SELF-SUPERVISED SPEECH MODELS

Files

License bundle

Collections

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)

Publication: DON'T SPEAK TOO FAST: THE IMPACT OF DATA BIAS ON SELF-SUPERVISED SPEECH MODELS

Files

License bundle

Collections

Publication:
DON'T SPEAK TOO FAST: THE IMPACT OF DATA BIAS ON SELF-SUPERVISED SPEECH MODELS