基于集成学习的Self-training在入侵检测中的应用

计算机应用 ›› 2010, Vol. 30 ›› Issue (3): 695-698.

基于集成学习的Self-training在入侵检测中的应用

程仲汉¹,臧洌²

1. 南京航空航天大学信息科学与技术学院
2.

收稿日期:2009-09-06 修回日期:2009-10-22 发布日期:2010-03-14 出版日期:2010-03-01
通讯作者: 程仲汉

Application of self-training based on ensemble learning in intrusion detection

Received:2009-09-06 Revised:2009-10-22 Online:2010-03-14 Published:2010-03-01

摘要/Abstract

摘要： 针对入侵检测的标记数据难以获得的问题，提出一种基于集成学习的Self-training方法——正则化Self-training。该方法结合主动学习和正则化理论，利用无标记数据对已有的分类器（该分类器对分类模式已学习得很好）作进一步的改进。对三种主要的集成学习方法在不同标记数据比例下进行对比实验，实验结果表明：借助大量无标记数据可以改善组合分类器的分类边界，算法能显著地降低结果分类器的错误率。

关键词: 半监督学习, 集成学习, 入侵检测

Abstract: Regularization self-training is a new method based on ensemble learning. It can solve the problem of insufficient labeled training samples in intrusion detection. The proposed algorithm combined active learning and regularization theory, and utilized unlabeled data to improve the existing classifiers. The experiments were running on three main ensemble learning algorithms under different unlabeled rate. The results prove that the proposed method can improve the boundary of the ensemble classifiers, and reduce the error rate with the help of large amounts of unlabeled data.

Key words: semi-supervised leaning, ensemble learning, instruction detection

程仲汉臧洌. 基于集成学习的Self-training在入侵检测中的应用[J]. 计算机应用, 2010, 30(3): 695-698.

[1]	张师鹏, 李永忠, 杜祥通. 基于半监督学习和三支决策的入侵检测模型[J]. 计算机应用, 2021, 41(9): 2602-2608.
[2]	毛铭泽, 曹芮浩, 闫春钢. 基于权值多样性的半监督分类算法[J]. 计算机应用, 2021, 41(9): 2473-2480.
[3]	曹玉红, 徐海, 刘荪傲, 王紫霄, 李宏亮. 基于深度学习的医学影像分割研究综述[J]. 计算机应用, 2021, 41(8): 2273-2287.
[4]	王月, 江逸茗, 兰巨龙. 基于改进三元组网络和K近邻算法的入侵检测[J]. 计算机应用, 2021, 41(7): 1996-2002.
[5]	王垚, 孙国梓. 基于聚类和实例硬度的入侵检测过采样方法[J]. 计算机应用, 2021, 41(6): 1709-1714.
[6]	张全龙, 王怀彬. 基于膨胀卷积和门控循环单元组合的入侵检测模型[J]. 计算机应用, 2021, 41(5): 1372-1377.
[7]	任晓奎, 刘鹏飞, 陶志勇, 刘影, 白立春. 基于单快拍信号到达角估计算法的室内入侵检测[J]. 计算机应用, 2021, 41(4): 1153-1159.
[8]	余东昌, 赵文芳, 聂凯, 张舸. 基于LightGBM算法的能见度预测模型[J]. 计算机应用, 2021, 41(4): 1035-1041.
[9]	秦静, 左长青, 汪祖民, 季长清, 王宝凤. 基于堆叠分类器的心电异常监测模型设计[J]. 计算机应用, 2021, 41(3): 887-890.
[10]	罗长银, 陈学斌, 马春地, 王君宇. 面向区块链的在线联邦增量学习算法[J]. 计算机应用, 2021, 41(2): 363-371.
[11]	朱玉娜, 张玉涛, 闫少阁, 范钰丹, 陈韩托. 基于半监督子空间聚类的协议识别方法[J]. 计算机应用, 2021, 41(10): 2900-2904.
[12]	周超然, 赵建平, 马太, 周欣. 基于注意力机制和集成学习的网页黑名单判别方法[J]. 计算机应用, 2021, 41(1): 133-138.
[13]	顾桐, 许国良, 李万林, 李家浩, 王志愿, 雒江涛. 基于集成LightGBM和贝叶斯优化策略的房价智能评估模型[J]. 计算机应用, 2020, 40(9): 2762-2767.
[14]	刘丹, 姚立霜, 王云锋, 裴作飞. 面向类不平衡流量数据的分类模型[J]. 计算机应用, 2020, 40(8): 2327-2333.
[15]	程小辉, 牛童, 汪彦君. 基于序列模型的无线传感网入侵检测系统[J]. 计算机应用, 2020, 40(6): 1680-1684.