Gender Bias in Instruction-Guided Speech Synthesis Models

Kuan, Chun-Yi; Lee, Hung-yi

doi:10.18653/v1/2025.findings-naacl.298

Gender Bias in Instruction-Guided Speech Synthesis Models

Journal

2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Proceedings of the Conference Findings, NAACL 2025

Start Page

5387

End Page

5413

ISBN (of the container)

979-889176195-7

Date Issued

2025-04-29

Author(s)

Kuan, Chun-Yi

Lee, Hung-yi

DOI

10.18653/v1/2025.findings-naacl.298

URI

https://www.scopus.com/record/display.uri?eid=2-s2.0-105028681053&origin=resultslist

https://scholars.lib.ntu.edu.tw/handle/123456789/736824

Abstract

Recent advancements in controllable expressive speech synthesis, especially in text-to-speech (TTS) models, have allowed for the generation of speech with specific styles guided by textual descriptions, known as style prompts. While this development enhances the flexibility and naturalness of synthesized speech, there remains a significant gap in understanding how these models handle vague or abstract style prompts. This study investigates the potential gender bias in how models interpret occupation-related prompts, specifically examining their responses to instructions like “Act like a nurse”. We explore whether these models exhibit tendencies to amplify gender stereotypes when interpreting such prompts. Our experimental results reveal the model’s tendency to exhibit gender bias for certain occupations. Moreover, models of different sizes show varying degrees of this bias across these occupations.

Event(s)

2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, NAACL 2025

Publisher

Association for Computational Linguistics

Type

conference paper

Gender Bias in Instruction-Guided Speech Synthesis Models

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)