https://scholars.lib.ntu.edu.tw/handle/123456789/489768
標題: | Discovering and explaining abnormal nodes in semantic graphs | 作者: | Lin, S.-D. Lin, Shou-de SHOU-DE LIN |
公開日期: | 2008 | 卷: | 20 | 期: | 8 | 起(迄)頁: | 1039-1052 | 來源出版物: | IEEE Transactions on Knowledge and Data Engineering | 摘要: | An important problem in the area of homeland security is to identify abnormal or suspicious entities in large data sets. Although there are methods from data mining and social network analysis focusing on finding patterns or central nodes from networks or numerical data sets, there has been little work aimed at discovering abnormal instances in large complex semantic graphs, whose nodes are richly connected with many different types of links. In this paper, we describe a novel unsupervised framework to identify such instances. Besides discovering abnormal instances, we believe that to complete the process, a system has to also provide users with understandable explanations for its findings. Therefore, in the second part of the paper, we describe an explanation mechanism to automatically generate human-understandable explanations for the discovered results. To evaluate our discovery and explanation systems, we perform experiments on several different semantic graphs. The results show that our discovery system outperforms state-of-the-art unsupervised network algorithms used to analyze the 9/11 terrorist network and other graph-based outlier detection algorithms by a significant margin. Additionally, the human study we conducted demonstrates that our explanation system, which provides natural language explanations for the system's findings, allowed human subjects to perform complex data analysis In a much more efficient and accurate manner. © 2008 IEEE. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/489768 | DOI: | 10.1109/TKDE.2007.190691 | SDG/關鍵字: | (e ,3e) process; Central nodes; Complex data; Different types; Explanation systems; Graph-based; Homeland security (HLS); Human subjects; Large data sets; Natural language explanations; Numerical data; Outlier Detection; Semantic graphs; Social network analysis (SNA); Unsupervised network; Administrative data processing; Arts computing; Decision support systems; Electric network analysis; Graph theory; Information management; Information theory; Knowledge management; Online searching; Search engines; Semantics; Set theory; Statistical methods; Security of data |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。