https://scholars.lib.ntu.edu.tw/handle/123456789/413149
標題: | Search scripts mining from wisdom of the crowds | 作者: | Wang C.-J. HSIN-HSI CHEN |
關鍵字: | mining web logs;search script;web search enhancement | 公開日期: | 2011 | 起(迄)頁: | 878-883 | 來源出版物: | IEEE International Conference on Systems, Man and Cybernetics | 摘要: | This paper mines sequences of actions called search scripts from query logs which keep large scale users' search experiences. Search scripts can be applied to predict users' search needs, improve the retrieval effectiveness, recommend advertisements, and so on. Information quality, topic diversity, query ambiguity, and URL relevancy are major challenging issues in search scripts mining. In this paper, we calculate the relevance of URLs, adopt the Open Directory Project (ODP) categories to disambiguate queries and URLs, explore various features and clustering algorithms for intent clustering, and identify critical actions from each intent cluster to form a search script. Experiments show that the model based on a complete link hierarchical clustering algorithm with the features of query terms, relevant URLs, and disambiguated ODP categories performs the best. Search scripts are generated from the best model. When only search scripts containing a single intent are considered to be correct, the accuracy of the action identification algorithm is 0.4650. If search scripts containing a major intent are also counted, the accuracy increases to 0.7315. ? 2011 IEEE. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/413149 | ISBN: | 9781457706523 | ISSN: | 1062922X | DOI: | 10.1109/ICSMC.2011.6083762 |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。