Semantic Clustering for the Constituent Morphemes of Chinese Disyllabic Compounds
Date Issued
2014
Date
2014
Author(s)
Lee, Chia-Ling
Abstract
Morphological awareness is thought by many linguists to strongly affect reading development in children. A Chinese character embedded in different compound words may carry different meanings. In this work, we aim at semantical clustering of a given family of morphologically related Chinese words. For example, "商店(store)", "商品(commodity)", "商代(Shang period)", and "商朝(Shang Dynasty)" can form two clusters: {"商店", "商品"} and {"商代", "商朝"}. In terms of meanings of the character "商/shang1/", the former subgroup conveys concepts about a Chinese dynasty, and the latter carries information about commerce. We aggregate computational linguistics methods, taking contextual, semantic, syntactic, lexical, and statistical factors into consideration. To contrast these results, in human experiment, we recruit adults and children to perform the clustering task. Experimental results indicate that our ensemble model achieves a similar level of performance as children.
Subjects
詞素覺識
語意分群
自然語言處理
計算語言
Type
thesis
File(s)![Thumbnail Image]()
Loading...
Name
ntu-103-R00922072-1.pdf
Size
23.32 KB
Format
Adobe PDF
Checksum
(MD5):0588186814f3e141b68fd97e4fc1ddec