Wang C.-J.HSIN-HSI CHEN2019-07-102019-07-10201197814577065231062922Xhttps://scholars.lib.ntu.edu.tw/handle/123456789/413149This paper mines sequences of actions called search scripts from query logs which keep large scale users' search experiences. Search scripts can be applied to predict users' search needs, improve the retrieval effectiveness, recommend advertisements, and so on. Information quality, topic diversity, query ambiguity, and URL relevancy are major challenging issues in search scripts mining. In this paper, we calculate the relevance of URLs, adopt the Open Directory Project (ODP) categories to disambiguate queries and URLs, explore various features and clustering algorithms for intent clustering, and identify critical actions from each intent cluster to form a search script. Experiments show that the model based on a complete link hierarchical clustering algorithm with the features of query terms, relevant URLs, and disambiguated ODP categories performs the best. Search scripts are generated from the best model. When only search scripts containing a single intent are considered to be correct, the accuracy of the action identification algorithm is 0.4650. If search scripts containing a major intent are also counted, the accuracy increases to 0.7315. ? 2011 IEEE.mining web logssearch scriptweb search enhancementSearch scripts mining from wisdom of the crowdsconference paper10.1109/ICSMC.2011.60837622-s2.0-83755178603https://www.scopus.com/inward/record.uri?eid=2-s2.0-83755178603&doi=10.1109%2fICSMC.2011.6083762&partnerID=40&md5=70d8bef059353bdcb50e06549e8c0b00