基于概要数据结构的全网络持续流检测方法

doi:10.11772/j.issn.1001-9081.2019010203

计算机应用 ›› 2019, Vol. 39 ›› Issue (8): 2354-2358.DOI: 10.11772/j.issn.1001-9081.2019010203

基于概要数据结构的全网络持续流检测方法

周爱平^1,2, 朱琛刚³

1. 泰州学院计算机科学与技术学院, 江苏泰州 225300;
2. 计算机网络和信息集成教育部重点实验室(东南大学), 南京 211189;
3. 东南大学计算机科学与技术学院, 南京 211189

收稿日期:2019-02-13 修回日期:2019-03-19 发布日期:2019-08-14 出版日期:2019-08-10
通讯作者: 周爱平
作者简介:周爱平(1982-),男,江苏泰州人,讲师,博士,CCF会员,主要研究方向:网络测量、数据挖掘;朱琛刚(1982-),男,江苏南京人,博士研究生,主要研究方向:大数据分析、网络测量。
基金资助:
国家自然科学基金资助项目（61802274）；计算机网络和信息集成教育部重点实验室（东南大学）开放课题资助项目（K93-9-2017-01）；泰州市科研启动基金资助项目（QD2016027）。

Detection method for network-wide persistent flow based on sketch data structure

ZHOU Aiping^1,2, ZHU Chengang³

1. School of Computer Science and Technology, Taizhou University, Taizhou Jiangsu 225300, China;
2. Key Laboratory of Computer Network and Information Integration of Ministry of Education(Southeast University), Nanjing Jiangsu 211189, China;
3. School of Computer Science and Engineering, Southeast University, Nanjing Jiangsu 211189, China

Received:2019-02-13 Revised:2019-03-19 Online:2019-08-14 Published:2019-08-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61802274), the Open Project Foundation of Key Laboratory of Computer Network and Information Integration of Ministry of Education (Southeast University) (K93-9-2017-01), the Scientific Research Foundation for Advanced Talents of Taizhou (QD2016027).

摘要/Abstract

摘要： 持续流是隐蔽的网络攻击过程中显现的一种重要特征，它不产生大量流量且在较长周期内有规律地发生，给传统的检测方法带来极大挑战。针对网络攻击的隐蔽性、单监测点的重负荷和信息有限的问题，提出全网络持续流检测方法。首先，设计一种概要数据结构，并将其部署在每个监测点；其次，当网络流到达监测点时，提取流的概要信息并更新概要数据结构的一位；然后，在测量周期结束时，主监测点将来自其他监测点的概要信息进行综合；最后，提出流持续性的近似估计，通过一些简单计算为每个流构建一个位向量，利用概率统计方法估计流持续性，使用修正后的持续性估计检测持续流。通过真实的网络流量进行实验，结果表明，与长持续时间流检测算法（TLF）相比，所提方法的准确性提高了50%，误报率和漏报率分别降低了22%和20%，说明全网络持续流检测方法能够有效监测高速网络流量。

关键词: 网络测量, 持续流检测, 网络攻击, 概要数据结构, 概率统计方法

Abstract: Persistent flow is an important feature of hidden network attack. It does not generate a large amount of traffic and it occurs regularly in a long period, so that it brings a large challenge for traditional detection methods. Network attacks have invisibility, single monitors have heavy load and limited information. Aiming at the above problems, a method to detect network-wide persistent flows was proposed. Firstly, a sketch data structure was designed and was deployed on each monitor. Secondly, when the network flow arrived at a monitor, the summary information was extracted from network data stream and one bit in the sketch data structure was updated. Thirdly, at the end of measurement period, the summary information from other monitors was synthesized by the main monitor. Finally, the approximate estimation of flow persistence was presented. A bit vector was constructed for each flow by some simple computing, flow persistence was estimated by using probability statistical method, and the persistent flows were detected based on revised persistence estimation. The experiments were conducted on real network traffic, and their results show that compared with the algorithm of Tracing Long Duration flows (TLF), the proposed method increases the accuracy by 50% and reduces the false positive rate, false negative rate by 22%, 20% respectively. The results illustrate that the method of detecting network-wide persistent flows can effectively monitor network traffic in high-speed networks.

Key words: network measurement, persistent flow detection, network attack, sketch data structure, probabilistic statistical method

中图分类号:

TP393.08

周爱平, 朱琛刚. 基于概要数据结构的全网络持续流检测方法[J]. 计算机应用, 2019, 39(8): 2354-2358.

ZHOU Aiping, ZHU Chengang. Detection method for network-wide persistent flow based on sketch data structure[J]. Journal of Computer Applications, 2019, 39(8): 2354-2358.

参考文献

[1] 赵小欢,夏靖波,付凯,等.高速网络流频繁项挖掘算法[J].计算机研究与发展,2014,51(11):2458-2469. (ZHAO X H, XIA J B, FU K, et al. Frequent items mining algorithm over network flows at high-speed network[J]. Journal of Computer Research and Development, 2014, 51(11):2458-2469.)
[2] LIU W, QU W, GONG J, et al. Detection of superpoints using a vector bloom filter[J]. IEEE Transactions on Information Forensics and Security, 2016, 11(3):514-527.
[3] CHEN A, JIN Y, CAO J, et al. Tracking long duration flows in network traffic[C]//Proceedings of the 2010 International Conference on Information Communications. Piscataway, NJ:IEEE, 2010:206-210.
[4] LEE S, SHIN S, YOON M. Detecting long duration flows without false negatives[J]. IEICE Transactions on Communications, 2011, 94(5):1460-1462.
[5] GIROIRE F, CHANDRASHEKAR J, TAFT N, et al. Exploiting temporal persistence to detect covert botnet channels[C]//Proceedings of the 2009 International Workshop on Recent Advances in Intrusion Detection, LNCS 5758. Berlin:Springer, 2009:326-345.
[6] HEULE S, NUNKESSER M, HALL A. HyperLogLog in practice:algorithmic engineering of a state of the art cardinality estimation algorithm[C]//Proceedings of the 16th International Conference on Extending Database Technology. New York:ACM, 2013:683-692.
[7] ZHOU Y, ZHOU Y, CHEN M, et al. Persistent spread measurement for big network data based on register intersection[J]. Proceedings of the ACM on Measurement and Analysis of Computing Systems, 2017, 1(1):No.15.
[8] XIAO Q, QIAO Y, ZHEN M, et al. Estimating the persistent spreads in high-speed networks[C]//Proceedings of the 22nd IEEE International Conference on Network Protocols. Piscataway:IEEE, 2014:131-142.
[9] SHIN S, YOON M. Virtual vectors and network traffic analysis[J]. IEEE Network, 2012, 26(1):22-26.
[10] LAHIRI B, CHANDRASHEKAR J, TIRTHAPURA S. Space-efficient tracking of persistent items in a massive data stream[C]//Proceedings of the 5th International Conference on Distributed Event-based System. New York:ACM, 2011:255-266.
[11] SINGH S A, TIRTHAPURA S. Monitoring persistent items in the union of distributed streams[J]. Journal of Parallel and Distributed Computing, 2014, 74(11):3115-3127.
[12] DAI H, SHAHZAD M, LIU A X, et al. Finding persistent items in data streams[J]. Proceedings of the VLDB Endowment, 2016, 10(4):289-300.
[13] ESTAN C, VARGHESE G, FISK M. Bitmap algorithms for counting active flows on high-speed links[J]. IEEE/ACM Transactions on Networking, 2006, 14(5):925-937.
[14] KUMAR A, XU J, WANG J. Space-code bloom filter for efficient per-flow traffic measurement[J]. IEEE Journal on Selected Areas in Communications, 2006, 24(12):2327-2339.
[15] CORMODE G, MUTHUKRISHNAN S. An improved data stream summary:the count-min sketch and its applications[J]. Journal of Algorithms, 2005, 55(1):58-75.
[16] FLAJOLET P, FUSY È, GANDOUET O, et al. HyperLogLog:the analysis of a near-optimal cardinality estimation algorithm[J]. Discrete Mathematics and Theoretical Computer Science, 2007, 28(3):127-146.
[17] HUANG Q, LEE P P C. A hybrid local and distributed sketching design for accurate scalable heavy key detection in network data streams[J]. Computer Networks:The International Journal of Computer and Telecommunications Networking, 2015, 91(C):298-315.
[18] ANTUNES N, PIPIRAS V. Estimation of flow distributions from sampled traffic[J]. ACM Transactions on Modeling and Performance Evaluation of Computing Systems, 2016, 1(3):No.17.
[19] IP Trace And Service[DS/OL].[2018-11-20]. http://iptas.edu.cn/src/system.php.

基于概要数据结构的全网络持续流检测方法

Detection method for network-wide persistent flow based on sketch data structure

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	吴中岱, 韩德志, 蒋海豹, 冯程, 韩冰, 陈重庆. 海洋船舶通信网络安全综述[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2123-2136.
[2]	王云燕, 胡爱花. 网络攻击下双层结构多智能体系统一致性[J]. 计算机应用, 2021, 41(5): 1399-1405.
[3]	吴若豪, 董平, 郑涛. 基于OpenDayLight的恶意扫描防护技术[J]. 计算机应用, 2018, 38(1): 188-193.
[4]	孙文君, 苏旸, 曹镇. 非对称信息条件下APT攻防博弈模型[J]. 计算机应用, 2017, 37(9): 2557-2562.
[5]	万智萍. 基于改进Das协议的无线传感器网络用户认证协议UAPL[J]. 计算机应用, 2014, 34(2): 452-455.
[6]	姜志宏王晖黄兵李沛樊鹏翼. P2P TV在线用户的时空分布研究[J]. 计算机应用, 2012, 32(07): 2022-2026.
[7]	姜志宏王晖樊鹏翼袁雪美. 一个P2P IPTV多协议爬行器——TVCrawler[J]. 计算机应用, 2010, 30(3): 715-718.
[8]	胡治国张大陆侯翠平张俊生. 自适应网络往返时延采样方法[J]. 计算机应用, 2010, 30(2): 319-322.
[9]	张大陆张俊生胡治国朱小庆. 基于子路径可用带宽测量的紧链路定位方法[J]. 计算机应用, 2010, 30(12): 3141-3144.
[10]	刘方正祁建清司贵生. 非均匀随机扫描的蠕虫离散传播模型[J]. 计算机应用, 2010, 30(10): 2677-2678.
[11]	黄光球李艳. 基于粗糙图的网络风险评估模型[J]. 计算机应用, 2010, 30(1): 190-195.
[12]	李欢高岭刘琳邢斌. 基于随机顺序的图形验证码改进算法设计[J]. 计算机应用, 2010, 30(06): 1501-1504.
[13]	肖丹杨英杰施敏建. 一种基于协同机制的攻击源追踪方法[J]. 计算机应用, 2007, 27(4): 854-856.
[14]	宇佳赵保华 . 一种可扩展的分布式网络攻击测试系统[J]. 计算机应用, 2006, 26(9): 2140-2144.
[15]	李树军 . 基于协议转变的拒绝服务攻击技术的研究[J]. 计算机应用, 2006, 26(10): 2323-2325.