Introducing Semantics into Speech Encoders

Xu, Derek; Dong, Shuyan; Wang, Changhan; Kim, Suyoun; Lin, Zhaojiang; Liu, Bing; Shrivastava, Akshat; Li, Shang Wen; Tseng, Liang Hsuan; Lin, Guan Ting; Baevski, Alexei; HUNG-YI LEE; Sun, Yizhou; Wang, Wei

Introducing Semantics into Speech Encoders

Journal

Proceedings of the Annual Meeting of the Association for Computational Linguistics

Journal Volume

1

ISBN

9781959429722

Date Issued

2023-01-01

Author(s)

Xu, Derek

Dong, Shuyan

Wang, Changhan

Kim, Suyoun

Lin, Zhaojiang

Liu, Bing

Shrivastava, Akshat

Li, Shang Wen

Tseng, Liang Hsuan

Lin, Guan Ting

Baevski, Alexei

HUNG-YI LEE

Sun, Yizhou

Wang, Wei

URI

https://scholars.lib.ntu.edu.tw/handle/123456789/636983

URL

https://api.elsevier.com/content/abstract/scopus_id/85174404904

Abstract

Recent studies find existing self-supervised speech encoders contain primarily acoustic rather than semantic information. As a result, pipelined supervised automatic speech recognition (ASR) to large language model (LLM) systems achieve state-of-the-art results on semantic spoken language tasks by utilizing rich semantic representations from the LLM. These systems come at the cost of labeled audio transcriptions, which is expensive and time-consuming to obtain. We propose a task-agnostic unsupervised way of incorporating semantic information from LLMs into self-supervised speech encoders without labeled audio transcriptions. By introducing semantics, we improve existing speech encoder spoken language understanding (SLU) performance by over 5% on intent classification (IC), with modest gains in named entity resolution (NER) and slot filling (SF), and spoken question answering (SQA) FF1 score by over 2%. Our approach, which uses no ASR data, achieves similar performance as methods trained on over 100 hours of labeled audio transcripts, demonstrating the feasibility of unsupervised semantic augmentations to existing speech encoders.

Type

conference paper

Introducing Semantics into Speech Encoders

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)