https://scholars.lib.ntu.edu.tw/handle/123456789/403821
標題: | Iterative machine-learning Chinese term extraction | 作者: | Lee, Chiaming CHIEN-KANG HUANG Tang, Kuoming KUANG-HUA CHEN |
公開日期: | 19-十一月-2012 | 來源出版物: | Lecture Notes in Computer Science | 摘要: | This paper presents an iterative approach to extracting Chinese terms. Unlike the traditional approach to extracting Chinese terms, which requires the assistance of a dictionary, the proposed approach exploits the Support Vector Machine classifier which learns the extraction rules from the occurrences of a single popular term in the corpus. Additionally, we have designed a very effective feature set and a systematic approach for selecting the positive and negative samples as the source of training. An ancient Chinese corpus, Chinese Buddhist Texts, was taken as the experiment corpus. According to our experiment results, the proposed approach can achieve a very competitive result in comparison with the Chinese Knowledge and Information Processing (CKIP) system from Academia Sinica. © 2012 Springer-Verlag. |
URI: | https://api.elsevier.com/content/abstract/scopus_id/84869032553 https://scholars.lib.ntu.edu.tw/handle/123456789/403821 |
ISBN: | 9783642347511 | ISSN: | 03029743 | DOI: | 10.1007/978-3-642-34752-8_37 |
顯示於: | 圖書館內部專用 圖書資訊學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。