https://scholars.lib.ntu.edu.tw/handle/123456789/413096
標題: | Detecting word usage errors in Chinese sentences for learning Chinese as a foreign language | 作者: | Shiue Y.-T. Chen H.-H. |
關鍵字: | Grammatical error detection;HSK corpus;Second language learning | 公開日期: | 2016 | 起(迄)頁: | 220-224 | 來源出版物: | 10th International Conference on Language Resources and Evaluation, LREC 2016 | 摘要: | Automated grammatical error detection, which helps users improve their writing, is an important application in NLP. Recently more and more people are learning Chinese, and an automated error detection system can be helpful for the learners. This paper proposes n-gram features, dependency count features, dependency bigram features, and single-character features to determine if a Chinese sentence contains word usage errors, in which a word is written as a wrong form or the word selection is inappropriate. With marking potential errors on the level of sentence segments, typically delimited by punctuation marks, the learner can try to correct the problems without the assistant of a language teacher. Experiments on the HSK corpus show that the classifier combining all sets of features achieves an accuracy of 0.8423. By utilizing certain combination of the sets of features, we can construct a system that favours precision or recall. The best precision we achieve is 0.9536, indicating that our system is reliable and seldom produces misleading results. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/413096 | ISBN: | 9782951740891 |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。