Chinese Explicit and Implicit Discourse Analysis
Date Issued
2014
Date
2014
Author(s)
Liao, Wan-Shan
Abstract
In recent years, research in natural language processing, with the study words, phrases levels become more sophisticated. Since the large-scale manually annotated corpus of discourse relations such as PDTB and RST-DT have been released, the study of discourse relation is increasing. If we could correctly predict the relationship between discourse, it will help to understand the semantic understanding. The related applications in natural language processing, such as QA systems, automatic summaries are also of great help.
However, due to the lack of a corpus of Chinese resources, the study in Chinese discourse relations are still little currently.
In this work, we first make a preliminary analysis for HIT-CIR Chinese Discourse Relations Corpus, Harbin Institute of Technology released in 2013. Because of small-scale of datasets, we turn to treat another large-scale pseudo dataset as the training set. Experimental results show that this large-scale corpus training model promote to predict the discourse relation of text from different sources.
Finally, we were further analyzed to the classification performance of implicit and explicit discourse relations, and analyzed whether the non-primary Markers is relevance to its discourse relation.
However, due to the lack of a corpus of Chinese resources, the study in Chinese discourse relations are still little currently.
In this work, we first make a preliminary analysis for HIT-CIR Chinese Discourse Relations Corpus, Harbin Institute of Technology released in 2013. Because of small-scale of datasets, we turn to treat another large-scale pseudo dataset as the training set. Experimental results show that this large-scale corpus training model promote to predict the discourse relation of text from different sources.
Finally, we were further analyzed to the classification performance of implicit and explicit discourse relations, and analyzed whether the non-primary Markers is relevance to its discourse relation.
Subjects
中文語篇關係
顯隱性關係
跨語料庫
語篇標記
Discourse Relation
Type
thesis
File(s)![Thumbnail Image]()
Loading...
Name
ntu-103-R01944023-1.pdf
Size
23.32 KB
Format
Adobe PDF
Checksum
(MD5):f2e6b5245651b14075ca627d194b7c82