Speech-IFEval: Evaluating Instruction-Following and Quantifying Catastrophic Forgetting in Speech-Aware Language Models

Lu, Ke-Han; Kuan, Chun-Yi; Lee, Hung-yi

doi:10.21437/interspeech.2025-619

Speech-IFEval: Evaluating Instruction-Following and Quantifying Catastrophic Forgetting in Speech-Aware Language Models

Journal

Interspeech 2025

Series/Report No.

Proceedings of the Annual Conference of the International Speech Communication Association Interspeech

Start Page

2078

End Page

2082

ISSN

2308457X

Date Issued

2025-08-17

Author(s)

Lu, Ke-Han

Kuan, Chun-Yi

Lee, Hung-yi

DOI

10.21437/interspeech.2025-619

URI

https://www.scopus.com/record/display.uri?eid=2-s2.0-105020071639&origin=resultslist

https://scholars.lib.ntu.edu.tw/handle/123456789/734026

Abstract

We introduce Speech-IFEval, an evaluation framework designed to assess instruction-following capabilities and quantify catastrophic forgetting in speech-aware language models (SLMs). Recent SLMs integrate speech perception with large language models (LLMs), often degrading textual capabilities due to speech-centric training. Existing benchmarks conflate speech perception with instruction-following, hindering evaluation of these distinct skills. To address this gap, we provide a benchmark for diagnosing the instruction-following abilities of SLMs. Our findings show that most SLMs struggle with even basic instructions, performing far worse than text-based LLMs. Additionally, these models are highly sensitive to prompt variations, often yielding inconsistent and unreliable outputs. We highlight core challenges and provide insights to guide future research, emphasizing the need for evaluation beyond task-level metrics.

Event(s)

26th Interspeech Conference 2025

Subjects

evaluation benchmarks

instruction following

speech-aware language model

SDGs

[SDGs]SDG10

Publisher

ISCA

Type

conference paper

Speech-IFEval: Evaluating Instruction-Following and Quantifying Catastrophic Forgetting in Speech-Aware Language Models

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)