https://scholars.lib.ntu.edu.tw/handle/123456789/116714
Title: | Improved Summarization of Chinese Spoken Documents by Probabilistic Latent Semantic Analysis (PLSA) with Further Analysis and Integrated Scoring | Authors: | Sheng-yi Kong LIN-SHAN LEE |
Keywords: | Probabilistic latent semantic analysis; Spoken document; Summarization | Issue Date: | 2006 | Start page/Pages: | 26-29 | Source: | 2006 IEEE ACL Spoken Language Technology Workshop, SLT 2006, Proceedings | Conference: | 2006 IEEE ACL Spoken Language Technology Workshop, SLT 2006 | Abstract: | In a previous paper [1] two new scoring measures, Topic Significance (TS) and Topic Entropy (TE), obtained from Probabilistic Latent Semantic Analysis (PLSA) were shown to outperform very successful baseline Significance Score (SS) in selecting the important sentences for summarization of spoken documents. In this paper extensive experiments using the ROUGE scores with respect to different parameters at different summarization ratios were carefully analyzed in great detail. It was also found that integration of these two scoring measures offered further improvements, and special considerations of the structure of Chinese language was also helpful when summarizing Chinese spoken documents. ©2006 IEEE. |
Description: | Aruba |
URI: | http://ntur.lib.ntu.edu.tw//handle/246246/220185 http://ntur.lib.ntu.edu.tw/bitstream/246246/220185/-1/25.pdf https://www.scopus.com/inward/record.uri?eid=2-s2.0-48749112591&doi=10.1109%2fSLT.2006.326808&partnerID=40&md5=5d4890a59d56eaa24fddfbfb310f6643 |
DOI: | 10.1109/SLT.2006.326808 | SDG/Keyword: | Image retrieval; Information theory; Learning systems; Probability; Semantics; Chinese language; Probabilistic latent semantic analysis (PLSA); Scoring measures; Spoken documents; Spoken languages; Linguistics; Semantics; Chinese language; Probabilistic latent semantic analysis; Scoring measures; Spoken document; Summarization |
Appears in Collections: | 資訊工程學系 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.