适于数据流组合分类的直推学习方法

计算机应用 ›› 2009, Vol. 29 ›› Issue (06): 1578-1581.

适于数据流组合分类的直推学习方法

刁树民¹,王永利²

1. 佳木斯大学公共计算机教研部
2. 佳木斯大学公共计算机教研部

收稿日期:2008-12-19 修回日期:2009-03-05 发布日期:2009-06-10 出版日期:2009-06-01
通讯作者: 刁树民
基金资助:
其他;校级基金

Transductive learning method applied to ensemble classification over data stream

Received:2008-12-19 Revised:2009-03-05 Online:2009-06-10 Published:2009-06-01

摘要/Abstract

摘要： 在进行组合决策时,已有的组合分类方法需要对多个组合分类器均有效的公共已知标签训练样本。为了解决在没有已知标签样本的情况下数据流组合分类决策问题,提出一种基于约束学习的数据流组合分类器的融合策略。在判定测试样本上的决策时,根据直推学习理论设计满足每一个局部分类器约束度量的方法,保证了约束的可行性,解决了分布式分类聚集时最大熵的直推扩展问题。测试数据集上的实验证明,与已有的直推学习方法相比,此方法可以获得更好的决策精度,可以应用于数据流组合分类的融合。

关键词: 数据流, 基于约束学习, 直推学习, 最大熵, 分布式组合分类, data streams, constraint-based learning, transductive learning, maximum entropy, distributed ensemble classification

Abstract: The existing strategy of combining decisions for ensemble classification method requires common labeled training samples across these ensemble classifiers. To resolve combining classifiers decisions among ensemble classification over data streams without labeled examples, a transductive constraint-based learning strategy was proposed. It satisfied the constraints measured by each local classifier based on transductive learning theory while choosing decision on test samples; thereby guaranteed the feasibility of the constraints. It solved the problems of transductive extension of maximum entropy for aggregation in distributed classification. Experimental examples prove that the proposed method can achieve higher classifying accuracy over the existing transductive approach and can be applied to ensemble classification fusing for data streams.

刁树民王永利. 适于数据流组合分类的直推学习方法[J]. 计算机应用, 2009, 29(06): 1578-1581.

[1]	李源潮, 陶重犇, 王琛. 基于最大熵深度强化学习的双足机器人步态控制方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 445-451.
[2]	穆栋梁, 韩萌, 李昂, 刘淑娟, 高智慧. 概念漂移复杂数据流分类方法综述[J]. 《计算机应用》唯一官方网站, 2023, 43(6): 1664-1675.
[3]	陈志强, 韩萌, 武红鑫, 李慕航, 张喜龙. 分段加权的概念漂移检测方法[J]. 《计算机应用》唯一官方网站, 2023, 43(3): 776-784.
[4]	陈虎, 周鹏灵. 面向国产高性能众核处理器的编程模型[J]. 《计算机应用》唯一官方网站, 2023, 43(11): 3517-3526.
[5]	王乐, 韩萌, 李小娟, 张妮, 程浩东. 基于动态加权函数的集成分类算法[J]. 《计算机应用》唯一官方网站, 2022, 42(4): 1137-1147.
[6]	李小娟, 韩萌, 王乐, 张妮, 程浩东. 基于准确率爬坡的动态加权集成分类算法[J]. 《计算机应用》唯一官方网站, 2022, 42(1): 123-131.
[7]	单芝慧, 韩萌, 韩强. 动态数据上的高效用模式挖掘综述[J]. 《计算机应用》唯一官方网站, 2022, 42(1): 94-108.
[8]	尹春勇, 张帼杰. 面向分布式漂移数据流的集成分类模型[J]. 计算机应用, 2021, 41(7): 1947-1955.
[9]	郭帅, 苏旸. 基于数据流的加密流量分类方法[J]. 计算机应用, 2021, 41(5): 1386-1391.
[10]	樊仲欣. 基于数据流的聚类趋势分析算法[J]. 计算机应用, 2020, 40(8): 2248-2254.
[11]	苏振宇, 宋桂香, 刘雁鸣, 赵媛. 服务器管理控制系统威胁建模与应用[J]. 计算机应用, 2019, 39(7): 1991-1996.
[12]	龚鸣清, 叶煌, 张鉴, 卢兴敬, 陈伟. 基于ARMv8架构的面向机器翻译的单精度浮点通用矩阵乘法优化[J]. 计算机应用, 2019, 39(6): 1557-1562.
[13]	孙小涓, 石涛, 胡玉新, 佟继周, 李冰, 宋峣. 基于流式计算的空间科学卫星数据实时处理[J]. 计算机应用, 2019, 39(6): 1563-1568.
[14]	张译天, 于炯, 鲁亮, 李梓杨. 大数据流式计算框架Heron环境下的流分类任务调度策略[J]. 计算机应用, 2019, 39(4): 1106-1116.
[15]	韩萌, 丁剑. 数据流频繁模式挖掘综述[J]. 计算机应用, 2019, 39(3): 719-727.