李秀惠臺灣大學:資訊工程學研究所陳建宏Chien-Hung, ChenChenChien-Hung2010-06-092018-07-052010-06-092018-07-052009U0001-2506200917191300http://ntur.lib.ntu.edu.tw//handle/246246/185433關鍵字擴展是增進資訊檢索效能的重要技術。依據擴展的文件集,可分為兩類,一類是基於內部文件集的關鍵字擴展,另一類則是基於網路文件集。 之前的研究顯示,基於內部文件集的擴展方法在增進檢索效能不佳的查詢上有瓶頸,然而,有人提出解決此問題之方法,是選用外部的文件集做為查詢擴展文件集,像是網路文件。於這兩類擴展方法的比較,我們的實驗結果顯示,的確在增進檢索效能不佳的查詢上,基於網路文件集-維基百科的關鍵字擴展有較好的效能,至於基於內部文件集而實行的關鍵字擴展,則在其他不同的查詢問題上有較好的表現。因此,我們提出將兩類不同文件集產生的擴展關鍵字列表作結合的方法,並評量其檢索效能的表現。大致上,我們擴展關鍵字的結合方法產生了較好的結果;但嚴謹的來說,我們只說我們的方式提供了平衡的結果。Query expansion is an important technique to improve search capability in information retrieval. According to expansion collection, there are two types of query expansion. One is query expansion performed on local documents and another is performed on web documents. The previous research found the method which is based on local documents has bottleneck on poorly performing topics, called hard topics. However, others propose to improve poorly performing topics is exploiting text collections other than the target collection such as the web. Regarding our comparison of these two types of query expansion, our result shows query expansion based on web resource, which is Wikipedia, indeed has better performance on hard topics. As for query expansion performed on local documents, it has better performance on other topics. Therefore, we propose a combined method to integrate two ranked lists of terms expanded by these two types of query expansion, and evaluate the corresponding search performance. Roughly speaking, our combined query expansion methods produce better performance. However, to view it in a strict way, our methods provide balanced results.中文摘要 vbstract vihaper 1 Introduction 1.1 Background 1.2 Motivation and Objective 3.3 Organization 3haper 2 Related Works 4.1 Query Expansion based on Local Documents 4.2 Query Expansion based on Web Documents 5.3 Combined Query Expansion Method 6haper 3 Research Approach 8.1 Framework 8.2 Query Expansion Method 9.2.1 Co-occurrence Statistics Method 10.2.2 Frequency Statistics Method 11.2.3 Combined Method 12.3 Combined Query Expansion Method 13.3.1 Score Normalization Methods 14.3.2 Expansion Term Merging Methods 16.4 Term weighting of Expanded Query Terms 17haper 4 Experiments 19.1 Experimental Environment 19.2 Blind Relevance Feedback(BRF)based on Local and Web Document 21.3 Combined Query Expansion Methods 25.4 Experiments Results Discussion 31haper 5 Conclusions and Future Works 33.1 Conclusions 33.2 Future Works 33eferences 35525876 bytesapplication/pdfen-US虛擬相關回饋關鍵字擴充維基百科Blind Relevance FeedbackQuery ExpansionWikipedia比較並結合基於內部和網路文件之查詢擴展方法Comparing and Combining Query Expansion Approaches based on Local and Web Documentsthesishttp://ntur.lib.ntu.edu.tw/bitstream/246246/185433/1/ntu-98-R96922046-1.pdf