Improving Cross-Blog Browsing Mechanism by Classification and Citation
Date Issued
2007
Date
2007
Author(s)
Tseng, Ru-Chi
DOI
en-US
Abstract
In this thesis, we utilize the citation links to cluster similar tags from different blogs together to provide a new way to assist cross-blog browsing.
Since blog systems could not communicate with each other, cross-blog searching and browsing is an issue to be solved. Tags defined in a blog indicate the aspects of communities the blogger belongs to. Thus, clustering similar tags together might help searching and browsing across blogs and distinguishing di?erent types of communities.
We transform the citation and content information to create graphs and experiment several graphical clustering methods. We also examine the traditional agglomerative hierarchical clustering methods using the information of content to have a thorough comparison.
The experiment result shows that clustering tags from blogs by the information of citation has roughly the same performance compared with clustering by the information of content in lower granularity and outperforms a little bit in higher granularity. However, it requires much more e?orts to process the data of content. Thus, citation link analysis is a light-weight and effective method to cluster tags and to assist cross-blog browsing.
Since blog systems could not communicate with each other, cross-blog searching and browsing is an issue to be solved. Tags defined in a blog indicate the aspects of communities the blogger belongs to. Thus, clustering similar tags together might help searching and browsing across blogs and distinguishing di?erent types of communities.
We transform the citation and content information to create graphs and experiment several graphical clustering methods. We also examine the traditional agglomerative hierarchical clustering methods using the information of content to have a thorough comparison.
The experiment result shows that clustering tags from blogs by the information of citation has roughly the same performance compared with clustering by the information of content in lower granularity and outperforms a little bit in higher granularity. However, it requires much more e?orts to process the data of content. Thus, citation link analysis is a light-weight and effective method to cluster tags and to assist cross-blog browsing.
Subjects
部落格
連結分析
標記分群
資料分群
Blog
Link Analysis
Tag Clustering
Data Clustering
Social Network Analysis
Type
other
File(s)![Thumbnail Image]()
Loading...
Name
ntu-96-R94725033-1.pdf
Size
23.31 KB
Format
Adobe PDF
Checksum
(MD5):15f403ee69d294f9e532af10404b343b