https://scholars.lib.ntu.edu.tw/handle/123456789/581102
標題: | Concept drift detection based on pre-clustering and statistical testing | 作者: | Wan J.S.-W SHENG-DE WANG |
關鍵字: | Concept drift; Drift detection; Stream data mining; Unsupervised | 公開日期: | 2021 | 卷: | 22 | 期: | 2 | 起(迄)頁: | 465-472 | 來源出版物: | Journal of Internet Technology | 摘要: | Stream data processing has become an important issue in the last decade. Data streams are generated on the fly and possibly change their data distribution over time. Data stream processing requires some mechanisms or methods to adapt to the changes of data distribution, which is called the concept drift. Concept drift detection can be challenging due to the data labels are not known. In this paper, we propose a drift detection method based on the statistical test with clustering and feature extraction as preprocessing. The goal is to reduce the detection time with principal component analysis (PCA) for the feature extraction method. Experimental results on synthetic and real-world streaming data show that the clustering preprocessing improve the performance of the drift detection and feature extraction trade-off an insignificant performance of detection for speedup for the execution time. ? 2021 Taiwan Academic Network Management Committee. All rights reserved. |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85103662554&doi=10.3966%2f160792642021032202020&partnerID=40&md5=68f3144959839acaf365bf1a26654bc0 https://scholars.lib.ntu.edu.tw/handle/123456789/581102 |
ISSN: | 16079264 | DOI: | 10.3966/160792642021032202020 | SDG/關鍵字: | Data streams; Economic and social effects; Extraction; Statistical tests; Concept drifts; Data distribution; Data stream processing; Detection methods; Detection time; Feature extraction methods; Statistical testing; Stream data processing; Feature extraction |
顯示於: | 電機工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。