Journal of Computer Applications ›› 2014, Vol. 34 ›› Issue (2): 542-545.
• Artificial intelligence • Previous Articles Next Articles
LIU Mao,ZHANG Dongbo,ZHAO Yuanyuan
Received:
Revised:
Online:
Published:
Contact:
刘茂,张东波,赵圆圆
通讯作者:
作者简介:
基金资助:
Abstract: To solve the false detection and detection delay of concept drift for data stream, a new online concept drift detection method based on the distance measurement of overlapped data windows was proposed in this paper. By dividing the data stream into overlapped data windows and computing the heterogeneous Euclidean distance of neighboring windows, and measuring the inconsistency of the data windows through the nearest neighbor principle, the authors could achieve the evaluation of distribution diversity and the detection of concept drift. To evaluate the effectiveness of the proposed method, experiments were made on some public data sets with different drift severity and drift speed. The experimental results show that the proposed method can detect different types of concept drift quickly and accurately and can figure out the locations where concept drift appeared. Key words: concept drift; data stream; heterogeneous Euclidean distance; overlap data windows
Key words: concept drift, data stream, heterogeneous Euclidean distance, overlap data window
摘要: 针对数据流中的概念漂移检测存在错误检测、延迟检测等问题,提出了一种基于交叠数据窗距离测度的在线概念漂移检测方法。通过将数据流划分成大小相等且交叠的数据窗并计算相邻交叠数据窗异构欧氏距离,同时利用近邻原则判别数据窗中样本不一致程度,从而实现分布差异性评价和漂移的检测。为评价该方法的有效性,在具有不同漂移严重程度和漂移速度的公开数据集上进行了实验,实验结果表明:该方法能够准确快速地检测到不同类型的概念漂移且能够找出概念漂移发生的具体位置。
关键词: 概念漂移, 数据流, 异构欧氏距离, 交叠数据窗
CLC Number:
TP18 人工智能理论
LIU Mao ZHANG Dongbo ZHAO Yuanyuan. Concept drift detection based on distance measurement of overlapped data windows[J]. Journal of Computer Applications, 2014, 34(2): 542-545.
刘茂 张东波 赵圆圆. 基于交叠数据窗距离测度概念漂移检测新方法[J]. 计算机应用, 2014, 34(2): 542-545.
0 / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.joca.cn/EN/
https://www.joca.cn/EN/Y2014/V34/I2/542