• •    

基于告警日志的网络故障预测研究

钟将1,时待吾1,王振华2   

  1. 1. 重庆市重庆大学计算机学院
    2. 华为技术有限公司
  • 收稿日期:2015-09-08 修回日期:2015-12-30 发布日期:2015-12-30
  • 通讯作者: 时待吾

Study on Network Failure Prediction Based on Alarm Log

  • Received:2015-09-08 Revised:2015-12-30 Online:2015-12-30
  • Contact: Dai-Wu SHI

摘要: 摘 要: 为避免因为网络故障导致难以预料的损失,在某些应用场景下需要预先评估网络中可能出现的故障。本文收集整理某城域网络14个月的网络告警日志作为网络故障预测研究的数据集并提出一种基于告警日志的网络故障预测研究方法:首先以本文提出的基于两级时间窗口的特征提取方法构建特征表征网络运行状态,并通过大量实验来选择构建特征所需的最佳参数组合,然后设计并实现了一种基于分类学习方法的自适应故障预测模型。大量的数据实验表明:对于整个网络未来6小时是否出现故障的预测准确率可以达到70%以上,明显好于基于威布尔分布的预测模型;在对网络设备故障进行预测时,分类预测的结果仍然优于基于威布尔分布的预测模型。论文初步研究结果表明,网络中大部分故障可通过网络运行日志数据进行预测,从而证明论文提出的方法具有较好的预测效果,能够在实际应用中及早发现故障,降低经济损失。

关键词: 关键词: 网络故障, 网络设备, 故障预测, 分类预测, 威布尔分布

Abstract: Abstract: To avoid unpredictable losses because of network failure, the reliability of the network needs to be evaluated in some application scenarios. This paper start the network failure prediction research upon 14 months’ network alarm logs we collected. The logs are of one Metropolitan area network. The research method is shown as below: firstly, construct features to represent network characteristics by the means of the feature construction method which is based on two levels time windows; secondly, select optimal parameter combination to create the feature files through multiple experiments; thirdly, design and build adaptive failure prediction model according to classification learning methods. Numbers of experiments show that accuracy of predicting whether the network failure takes place in 6 hours is up to 70%, is better than the prediction result of Weibull distribution model obviously; the results of classification prediction for network equipment failure are slightly better than the prediction method on the basis of Weibull distribution. Preliminary research results show that most network failures can be predicted through analyzing previous network running logs and the method proposed in this paper is verified to be with good prediction effect. This method can detect failures in practical application on early stage and reduce unnecessary economic losses.

Key words: Keywords: network failure, network equipment, failure prediction, classification prediction, Weibull distribution

中图分类号: