计算机应用 ›› 2020, Vol. 40 ›› Issue (8): 2241-2247.DOI: 10.11772/j.issn.1001-9081.2020010010

• 人工智能 • 上一篇    下一篇

基于深度特征和Seq2Seq模型的网络态势预测方法

林志兴1,2, 王立可3   

  1. 1. 三明学院 网络中心, 福建 三明 365004;
    2. 福建师范大学 数学与信息学院, 福州 350007;
    3. 中国科学院 成都计算机应用研究所, 成都 610041
  • 收稿日期:2020-01-03 修回日期:2020-04-12 出版日期:2020-08-10 发布日期:2020-04-16
  • 通讯作者: 王立可(1993-),男,山东临沂人,硕士研究生,主要研究方向:自然语言处理、深度学习。964560581@qq.com
  • 作者简介:林志兴(1973-),男,福建尤溪人,高级实验师,硕士,主要研究方向:信息化、网络安全。
  • 基金资助:
    国家自然科学基金资助项目(61771140);2018年福建省科技厅自然科学基金资助项目(2018J01560);2017年福建省中青年教师教育科研项目(JAT170552);四川省科技计划项目(2018GZDZX0041,2019ZDZX0005,2019ZDZX0006)。

Network situation prediction method based on deep feature and Seq2Seq model

LIN Zhixing1,2, WANG Like3   

  1. 1. Network Center, Sanming University, Sanming Fujian 365004, China;
    2. College of Mathematics and Informatics, Fujian Normal University, Fuzhou Fujian 350007, China;
    3. Chengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu Sichuan 610041, China
  • Received:2020-01-03 Revised:2020-04-12 Online:2020-08-10 Published:2020-04-16
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61771140), the Natural Science Foundation of Science and Technology Department of Fujian Province in 2018 (2018J01560), the Education and Scientific Research Project of Young and Middle-Aged Teachers in Fujian Province in 2017 (JAT170552), the Sichuan Science and Technology Program (2018GZDZX0041, 2019ZDZX0005, 2019ZDZX0006).

摘要: 针对目前大多数的网络态势预测方法不能挖掘数据中的深度信息且需要手动提取与构造特征的问题,提出了深度特征网络态势预测方法DFS-Seq2Seq。首先将网络流、日志和系统事件等产生的数据进行清洗处理,使用深度特征融合算法自动合成深度关系特征,然后采用自动编码器对合成的特征进行提取,最后使用长短期记忆网络(LSTM)构建Seq2Seq模型对数据进行预测。通过设计缜密的实验在公开数据集Kent2016上对所提方法进行验证,结果显示在深度为2时与支持向量机(SVM)、贝叶斯、随机森林(RF)和LSTM这四种分类模型相比,其召回率分别提升了7.4%、11.5%、6.5%、3.0%。实验结果表明DFS-Seq2Seq可以在实际应用中有效地识别网络身份验证中的危险事件,对网络态势作出有效的预测。

关键词: 网络态势, 深度特征合成, 自动编码器, Seq2Seq模型, 双向长短期记忆网络

Abstract: In view of the problem that most existing network situation prediction methods are unable to mine the deep information in the data and need to manually extract and construct features, a deep feature network situation prediction method named DFS-Seq2Seq (Deep Feature Synthesis-Sequence to Sequence) was proposed. First, the data produced by network streams, logs and system events were cleaned, and the deep feature synthesis algorithm was used to automatically synthesize the deep relation features. Then the synthesized features were extracted by the AutoEncoder (AE). Finally, the data was estimated by using the Seq2Seq (Sequence to Sequence) model constructed by Long Short-Term Memory (LSTM). Through a well-designed experiment, the proposed method was verified on the public dataset Kent2016. Experimental results show that when the depth is 2, compared with four classification models including Support Vector Machine (SVM), Bayes, Random Forest (RF) and LSTM, the proposed method has the recall rate increased by 7.4%, 11.5%, 6.5% and 3.0%, respectively. It is verified that DFS-Seq2Seq can effectively identify dangerous events in network authentication and effectively predict network situation in practice.

Key words: network situation, deep feature synthesis, AutoEncoder (AE), Seq2Seq (Sequence to Sequence) model, Bi-directional Long Short-Term Memory (Bi-LSTM) network

中图分类号: