Extracting Noun Phrases from Large-Scale Texts: A Hybrid Approach and Its Automatic Evaluation
Date Issued
1994
Date
1994
Author(s)
Abstract
To acquire noun phrases from running texts is useful for
many applications, such as word grouping, terminology
indexing, etc. The reported literatures adopt pure
probabilistic approach, or pure rule-based noun phrases
grammar to tackle this problem. In this paper, we apply
a probabilistic chunker to deciding the implicit
boundaries of constituents and utilize the linguistic
knowledge to extract the noun phrases by a finite state
mechanism. The test texts are SUSANNE Corpus and
the results are evaluated by comparing the parse field of
SUSANNE Corpus automatically. The results of this
preliminary experiment are encouraging.
many applications, such as word grouping, terminology
indexing, etc. The reported literatures adopt pure
probabilistic approach, or pure rule-based noun phrases
grammar to tackle this problem. In this paper, we apply
a probabilistic chunker to deciding the implicit
boundaries of constituents and utilize the linguistic
knowledge to extract the noun phrases by a finite state
mechanism. The test texts are SUSANNE Corpus and
the results are evaluated by comparing the parse field of
SUSANNE Corpus automatically. The results of this
preliminary experiment are encouraging.
Publisher
New Mexico
Description
In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (pp. 234-241)
Type
conference paper
File(s)![Thumbnail Image]()
Loading...
Name
ACL1994.pdf
Size
124.98 KB
Format
Adobe PDF
Checksum
(MD5):d1b6a44bfb3d57556c36e176c844a701