https://scholars.lib.ntu.edu.tw/handle/123456789/24966
標題: | A Probabilistic Chunker | 作者: | Chen, Kuang-hua Chen, Hsin-Hsi |
公開日期: | 1993 | 出版社: | Taipei, Taiwan: ROCLING | 起(迄)頁: | 99-117 | 來源出版物: | 6th R.O.C. Computational Linguistics Conference VI | 摘要: | This paper proposes a probabilistic partial parser, which we call chunker. The chunker partitions the input sentence into segments. This idea is motivated by the fact that when we read a sentence, we read it chunk by chunk. We train the chunker from Susanne Corpus, which is a modified but shrinked version of Brown Corpus, underlying bi-gram language model. The experiment is evaluated by outside test and inside test. The preliminary results show the chunker has more than 98% chunk correct rate and 94% sentence correct rate in outside test, and 99% chunk correct rate and 97% sentence correct rate in inside test. The simple but effective chunker design has shown to be promising and can be extended to complete parsing and many applications. © 1993 Proceedings of Rocling 6th Computational Linguistics Conference, ROCLING 1993. All rights reserved. |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-79551569515&partnerID=40&md5=28600ba469e02fdc59bc08537619c39a | SDG/關鍵字: | Computational linguistics; Bi-gram language models; Probabilistics; Simple++; Syntactics |
顯示於: | 圖書資訊學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。