Linear Classifier: An Often-Forgotten Baseline for Text Classification

ANGELA YU-CHEN LIN; Chen, Si An; Liu, Jie Jyun; CHIH-JEN LIN

doi:10.18653/v1/2023.acl-short.160

Linear Classifier: An Often-Forgotten Baseline for Text Classification

Journal

Proceedings of the Annual Meeting of the Association for Computational Linguistics

Journal Volume

2

ISBN

9781959429715

Date Issued

2023-01-01

Author(s)

ANGELA YU-CHEN LIN

Chen, Si An

Liu, Jie Jyun

CHIH-JEN LIN

DOI

10.18653/v1/2023.acl-short.160

URI

https://scholars.lib.ntu.edu.tw/handle/123456789/636595

URL

https://api.elsevier.com/content/abstract/scopus_id/85172233514

Abstract

Large-scale pre-trained language models such as BERT are popular solutions for text classification. Due to the superior performance of these advanced methods, nowadays, people often directly train them for a few epochs and deploy the obtained model. In this opinion paper, we point out that this way may only sometimes get satisfactory results. We argue the importance of running a simple baseline like linear classifiers on bag-of-words features along with advanced methods. First, for many text data, linear methods show competitive performance, high efficiency, and robustness. Second, advanced models such as BERT may only achieve the best results if properly applied. Simple baselines help to confirm whether the results of advanced models are acceptable. Our experimental results fully support these points.

SDGs

[SDGs]SDG4

Type

conference paper

Linear Classifier: An Often-Forgotten Baseline for Text Classification

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)