https://scholars.lib.ntu.edu.tw/handle/123456789/24966
Title: | A Probabilistic Chunker | Authors: | Chen, Kuang-hua Chen, Hsin-Hsi |
Issue Date: | 1993 | Publisher: | Taipei, Taiwan: ROCLING | Start page/Pages: | 99-117 | Source: | 6th R.O.C. Computational Linguistics Conference VI | Abstract: | This paper proposes a probabilistic partial parser, which we call chunker. The chunker partitions the input sentence into segments. This idea is motivated by the fact that when we read a sentence, we read it chunk by chunk. We train the chunker from Susanne Corpus, which is a modified but shrinked version of Brown Corpus, underlying bi-gram language model. The experiment is evaluated by outside test and inside test. The preliminary results show the chunker has more than 98% chunk correct rate and 94% sentence correct rate in outside test, and 99% chunk correct rate and 97% sentence correct rate in inside test. The simple but effective chunker design has shown to be promising and can be extended to complete parsing and many applications. © 1993 Proceedings of Rocling 6th Computational Linguistics Conference, ROCLING 1993. All rights reserved. |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-79551569515&partnerID=40&md5=28600ba469e02fdc59bc08537619c39a | SDG/Keyword: | Computational linguistics; Bi-gram language models; Probabilistics; Simple++; Syntactics |
Appears in Collections: | 圖書資訊學系 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.