Mining Significant Subspaces

ANTHONY  J. T. LEE; Ming-Chih Lin; Yun-Ru Wang; Kuo-Tay Chen; Kuo-Tay Chen;ANTHONY  J. T. LEE;Yun-Ru Wang;Ming-Chih Lin

doi:10.6382/JIM.201012.0027

Mining Significant Subspaces

Other Title

重要子空間之資料探勘

Journal

資訊管理學報

Journal Volume

17卷專刊

Pages

27-49

Date Issued

2010-12

Author(s)

ANTHONY J. T. LEE

Ming-Chih Lin

Yun-Ru Wang

Kuo-Tay Chen

DOI

10.6382/JIM.201012.0027

URI

https://scholars.lib.ntu.edu.tw/handle/123456789/415139

Abstract

As both the number of dimensions increases, existing clustering methods in full feature space are not appropriate to cluster data in databases. Thus, the subspace clustering has attracted more and more attention recently. In this paper, we propose a novel method to mine significant subspaces from all frequent subspaces, where a subspace is frequent if it contains enough data points. The proposed method consists of three phases. First, we generate all frequent 2-dimensional subspaces. Second, we recursively combine frequent k-dimensional subspaces to generate frequent (k+1)-dimensional subspaces, k≥2. Finally, we adopt a greedy algorithm to summarize the frequent subspaces generated and select the significant ones. The experimental results show that the proposed method has better quality and coverage than DUSC, and better quality than FIRES.

Type

journal article

Mining Significant Subspaces

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)