https://scholars.lib.ntu.edu.tw/handle/123456789/154897
Title: | Clustering over Multiple Evolving Streams by Events and Correlations | Authors: | Yeh, Mi-Yen Dai, Bi-Ru MING-SYAN CHEN |
Keywords: | Data clustering; Data mining; Data streams | Issue Date: | 2007 | Journal Volume: | 19 | Journal Issue: | 10 | Start page/Pages: | 1349-1362 | Source: | IEEE Transactions on Knowledge and Data Engineering | Abstract: | In applications of multiple data streams such as stock market trading and sensor network data analysis, the clusters of streams change at different times because of data evolution. The information about evolving cluster is valuable to support corresponding online decisions. In this paper, we present a framework for Clustering Over Multiple Evolving sTreams by CORrelations and Events, which, abbreviated as COMET-CORE, monitors the distribution of clusters over multiple data streams based on their correlation. Instead of directly clustering the multiple data streams periodically, COMET-CORE applies efficient cluster spilt and merge processes only when significant cluster evolution happens. Accordingly, we devise an event detection mechanism to signal the cluster adjustments. The coming streams are smoothed as sequences of end points by employing piecewise linear approximation. At the time when end points are generated, weighted correlations between streams are updated. End points are good indicators of significant change in streams, and this is a main cause of a cluster evolution event. When an event occurs, through split and merge operations we can report the latest clustering results. As shown in our experimental studies, COMET-CORE can be performed effectively with good clustering quality. © 2007 IEEE. |
URI: | http://ntur.lib.ntu.edu.tw//handle/246246/141981 http://ntur.lib.ntu.edu.tw/bitstream/246246/141981/1/55.pdf https://www.scopus.com/inward/record.uri?eid=2-s2.0-34648854456&doi=10.1109%2fTKDE.2007.1071&partnerID=40&md5=8bf7c8d4d12cd1e7b269b257d96daa56 |
ISSN: | 10414347 | DOI: | 10.1109/TKDE.2007.1071 | SDG/Keyword: | Correlation methods; Data mining; Data reduction; Decision making; Online systems; Sensor networks; Data clustering; Data evolution; Data streams; Stock markets; Cluster analysis |
Appears in Collections: | 電機工程學系 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.