https://scholars.lib.ntu.edu.tw/handle/123456789/413119
標題: | Intent mining in search query logs for automatic search script generation | 作者: | Wang C.-J. Chen H.-H. |
關鍵字: | Intent mining;Query log analysis;Search script generation;Web search enhancement | 公開日期: | 2014 | 卷: | 39 | 期: | 3 | 起(迄)頁: | 513-542 | 來源出版物: | Knowledge and Information Systems | 摘要: | Capturing users' information needs is essential in decreasing the barriers in information access. This paper mines sequences of actions called search scripts from search query logs which keep large-scale users' search experiences. Search scripts can be applied to guide users to satisfy their information needs, improve the search effectiveness of retrieval systems, recommend advertisements at suitable places, and so on. Information quality, query ambiguity, topic diversity, and document relevancy are four major challenging issues in search script mining. In this paper, we determine the relevance of URLs for a query, adopt the Open Directory Project (ODP) categories to disambiguate queries and URLs, explore various features and clustering algorithms for intent clustering, identify critical actions from each intent cluster to form a search script, generate a nature language description for each action, and summarize a topic for each search script. Experiments show that the complete link hierarchical clustering algorithm with the features of query terms, relevant URLs, and disambiguated ODP categories performs the best. Applying the intent clusters created by the best model to intent boundary identification achieves an F score of 0.6666. The intent clusters then are applied to generate search scripts. ? 2013 Springer-Verlag London. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/413119 | ISSN: | 02191377 | DOI: | 10.1007/s10115-013-0620-3 |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。