Preserving Zero-shot Capability in Supervised Fine-tuning for Multi-label Text Classification

Chen, Si-An; Lin, Hsuan-Tien; Lin, Chih-Jen

doi:10.18653/v1/2025.findings-naacl.315

Preserving Zero-shot Capability in Supervised Fine-tuning for Multi-label Text Classification

Journal

2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Proceedings of the Conference Findings, NAACL 2025

Start Page

5714

End Page

5727

ISBN (of the container)

979-889176195-7

Date Issued

2025-04

Author(s)

Chen, Si-An

Lin, Hsuan-Tien

Lin, Chih-Jen

DOI

10.18653/v1/2025.findings-naacl.315

URI

https://www.scopus.com/record/display.uri?eid=2-s2.0-105028709687&origin=resultslist

https://scholars.lib.ntu.edu.tw/handle/123456789/737206

Abstract

Zero-shot multi-label text classification (ZMTC) requires models to predict multiple labels for a document, including labels unseen during training. Previous work assumes that models leveraging label descriptions ensures zero-shot capability. However, we find that supervised methods, despite achieving strong overall performance, lose their zero-shot capability during training, revealing a trade-off between overall and zero-shot performance. To address the issue, we propose OF-DE and OF-LAN, which preserve the zero-shot capabilities of powerful dual encoder and label-wise attention network architectures by freezing the label encoder. Additionally, we introduce a self-supervised auxiliary loss to further improve zero-shot performance. Experiments demonstrate that our approach significantly improves zero-shot performance of supervised methods while maintaining strong overall accuracy.

Event(s)

2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, NAACL 2025

Publisher

Association for Computational Linguistics

Type

conference paper

Preserving Zero-shot Capability in Supervised Fine-tuning for Multi-label Text Classification

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)