Iterative machine-learning Chinese term extraction

標題:	Iterative machine-learning Chinese term extraction
作者:	Lee, Chiaming CHIEN-KANG HUANG Tang, Kuoming KUANG-HUA CHEN
公開日期:	19-十一月-2012
來源出版物:	Lecture Notes in Computer Science
摘要:	This paper presents an iterative approach to extracting Chinese terms. Unlike the traditional approach to extracting Chinese terms, which requires the assistance of a dictionary, the proposed approach exploits the Support Vector Machine classifier which learns the extraction rules from the occurrences of a single popular term in the corpus. Additionally, we have designed a very effective feature set and a systematic approach for selecting the positive and negative samples as the source of training. An ancient Chinese corpus, Chinese Buddhist Texts, was taken as the experiment corpus. According to our experiment results, the proposed approach can achieve a very competitive result in comparison with the Chinese Knowledge and Information Processing (CKIP) system from Academia Sinica. © 2012 Springer-Verlag.
URI:	https://api.elsevier.com/content/abstract/scopus_id/84869032553 https://scholars.lib.ntu.edu.tw/handle/123456789/403821
ISBN:	9783642347511
ISSN:	03029743
DOI:	10.1007/978-3-642-34752-8_37
顯示於：	圖書館內部專用圖書資訊學系

顯示文件完整紀錄

checked on 2023/11/13

165

checked on 2024/5/4

檢查

TAIR相關文章