https://scholars.lib.ntu.edu.tw/handle/123456789/636753
標題: | Using Co-word Network Community Detection and LDA Topic Modeling to Extract Topics in TED Talks | 作者: | Hung, Li Ting MUH-CHYUN TANG Lin, Sung Chien |
關鍵字: | Co-word analysis | LDA topic modeling | TED Talk | 公開日期: | 1-一月-2023 | 卷: | 14039 LNCS | 起(迄)頁: | 140 - 154 | 來源出版物: | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | 摘要: | Two topic detection techniques—co-word network analysis and topic modeling—were applied to extract topics in the Ted Talks. Ted Talks was chosen for its enormous impact worldwide and the rich descriptive data accompanying each talk that allow us to compare the topics resulting from different methods. The co-word network was built based on the “related_tags” field so that modularity analysis can be performed to classify the tags according to their co-occurrence patterns. Topic modeling was applied to the description field and the full-text transcript separately to detect the topics present in the free-text. The results of network modularity analysis revealed 13 interpretable topics consisting of closely knitted tags. Topic modeling generated 25 topics for the description and 40 for the transcript, respectively. Our results showed that both topic extraction methods were able to successfully identify the range of topics in the TED Talks. While the co-word network gave a broad overview and afforded visualization, the topic model revealed topics with greater granularity. We compared the semantics of the topics produced by different methods and discussed the methodological implications of our research. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/636753 | ISBN: | 9783031360480 | ISSN: | 03029743 | DOI: | 10.1007/978-3-031-36049-7_11 |
顯示於: | 圖書資訊學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。