A Semantics-Enhanced Language Model for Unsupervised Word Sense Disambiguation
Resource
CICLing 2008: 287-298
Journal
CICLing 2008:
Pages
287-298
Date Issued
2008
Date
2008
Author(s)
Verspoor, Karin
Abstract
An N-gram language model aims at capturing statistical word order dependency information from corpora. Although the concept of language models has been applied extensively to handle a variety of NLP problems with reasonable success, the standard model does not incorporate semantic information, and consequently limits its applicability to semantic problems such as word sense disambiguation. We propose a framework that integrates semantic information into the language model schema, allowing a system to exploit both syntactic and semantic information to address NLP problems. Furthermore, acknowledging the limited availability of semantically annotated data, we discuss how the proposed model can be learned without annotated training examples.
Finally, we report on a case study showing how the semantics-enhanced language model can be applied to unsupervised word sense disambiguation with promising results.
Type
conference paper
File(s)![Thumbnail Image]()
Loading...
Name
CICLING08.pdf
Size
23.23 KB
Format
Adobe PDF
Checksum
(MD5):d6d6c69f0c4d0716022f84f468af53a9
