https://scholars.lib.ntu.edu.tw/handle/123456789/413158
標題: | Collaborative cyberporn filtering with collective intelligence | 作者: | Lee L.-H. HSIN-HSI CHEN |
關鍵字: | Pornographic blacklists;Query log analysis;Searches-and-clicks | 公開日期: | 2011 | 起(迄)頁: | 1153-1154 | 來源出版物: | 34th International ACM SIGIR Conference on Research and Development in Information Retrieval | 摘要: | This paper presents a user intent method to generate blacklists for collaborative cyberporn filtering. A novel porn detection framework that finds new pornographic web pages by mining user search behaviors is proposed. It employs users' clicks in search query logs to select the suspected web pages without extra human efforts to label data for training, and determines their categories with the help of URL host name and path information, but without web page content. We adopt an MSN porn data set to explore the effectiveness of our method. This user intent approach achieves high precision, while maintaining favorably low false positive rate. In addition, real-life filtering simulation reveals that our user intent method with its accumulative update strategy achieves 43.36% of blocking rate, while maintaining a steadily less than 7% of over-blocking rate. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/413158 | ISBN: | 9781450309349 | DOI: | 10.1145/2009916.2010095 |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。