Discovering and explaining abnormal nodes in semantic graphs

Lin, S.-D.; Lin, Shou-de; SHOU-DE LIN; Lin, S.-D.;Chalupsky, H.

doi:10.1109/TKDE.2007.190691

Discovering and explaining abnormal nodes in semantic graphs

Journal

IEEE Transactions on Knowledge and Data Engineering

Journal Volume

20

Journal Issue

8

Pages

1039-1052

Date Issued

2008

Author(s)

Lin, S.-D.

Lin, Shou-de

SHOU-DE LIN

DOI

10.1109/TKDE.2007.190691

URI

https://scholars.lib.ntu.edu.tw/handle/123456789/489768

URL

https://www.scopus.com/inward/record.uri?eid=2-s2.0-46649095259&doi=10.1109%2fTKDE.2007.190691&partnerID=40&md5=adecd7c78b5fe4f4ea95af34b6e83249

Abstract

An important problem in the area of homeland security is to identify abnormal or suspicious entities in large data sets. Although there are methods from data mining and social network analysis focusing on finding patterns or central nodes from networks or numerical data sets, there has been little work aimed at discovering abnormal instances in large complex semantic graphs, whose nodes are richly connected with many different types of links. In this paper, we describe a novel unsupervised framework to identify such instances. Besides discovering abnormal instances, we believe that to complete the process, a system has to also provide users with understandable explanations for its findings. Therefore, in the second part of the paper, we describe an explanation mechanism to automatically generate human-understandable explanations for the discovered results. To evaluate our discovery and explanation systems, we perform experiments on several different semantic graphs. The results show that our discovery system outperforms state-of-the-art unsupervised network algorithms used to analyze the 9/11 terrorist network and other graph-based outlier detection algorithms by a significant margin. Additionally, the human study we conducted demonstrates that our explanation system, which provides natural language explanations for the system's findings, allowed human subjects to perform complex data analysis In a much more efficient and accurate manner. © 2008 IEEE.

SDGs

[SDGs]SDG16

Other Subjects

(e ,3e) process; Central nodes; Complex data; Different types; Explanation systems; Graph-based; Homeland security (HLS); Human subjects; Large data sets; Natural language explanations; Numerical data; Outlier Detection; Semantic graphs; Social network analysis (SNA); Unsupervised network; Administrative data processing; Arts computing; Decision support systems; Electric network analysis; Graph theory; Information management; Information theory; Knowledge management; Online searching; Search engines; Semantics; Set theory; Statistical methods; Security of data

Type

conference paper

Discovering and explaining abnormal nodes in semantic graphs

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)