中文口語處理技術之前瞻性研究課題( 2/3 )
Date Issued
2004-07-31
Date
2004-07-31
Author(s)
DOI
922213E002035
Abstract
The Chinese language is not only spoken by the largest
population in the world, but quite different from many western
languages with a very special structure. It is not alphabetic:
large number of Chinese characters are ideographic symbols and
pronounced as monosyllables. The open vocabulary nature, the
flexible wording structure and the tone behavior are also good
examples within the special structure. It is believed that better
results and performance will be obtainable in developing
Chinese spoken language processing technologies, if this special
structure can be taken into account. In this paper, a set of
“feature units” for Chinese spoken language processing is
identified, and the retrieval, segmentation and summarization of
Chinese spoken documents are taken as examples in analyzing
the use of such “feature units”. Experimental results indicate
that by careful considerations of the special structure and proper
choice of the “feature units”, significantly better performance
can be achieved.
Publisher
臺北市:國立臺灣大學電信工程學研究所
Type
report
File(s)![Thumbnail Image]()
Loading...
Name
922213E002035.pdf
Size
119.37 KB
Format
Adobe PDF
Checksum
(MD5):f1285fbdc0a790a2503cb1809b049158
