计算机应用 ›› 2018, Vol. 38 ›› Issue (10): 2881-2885.DOI: 10.11772/j.issn.1001-9081.2018040879

• 数据科学与技术 • 上一篇    下一篇

面向电信行业网络告警系统的告警过滤算法

徐冰珂1, 周宇喆1, 杨茂林1, 谢远航2, 李晓瑜1, 雷航1   

  1. 1. 电子科技大学 信息与软件工程学院, 成都 610054;
    2. 北京思特奇信息技术股份有限公司, 北京 100046
  • 收稿日期:2018-04-28 修回日期:2018-06-11 出版日期:2018-10-10 发布日期:2018-10-13
  • 通讯作者: 杨茂林
  • 作者简介:徐冰珂(1997-),女,四川乐山人,主要研究方向:数据分析;周宇喆(1997-),女,四川遂宁人,主要研究方向:自然语言处理、数据分析;杨茂林(1987-),男,四川宜宾人,助理研究员,博士,CCF会员,主要研究方向:数据分析、实时计算;谢远航(1990-),男,四川南充人,工程师,主要研究方向:电信运维监控系统、大数据分析;李晓瑜(1984-),女,山东菏泽人,副教授,博士,CCF会员,主要研究方向:量子计算、机器学习、大数据分析;雷航(1960-),男,四川自贡人,教授,博士,主要研究方向:人工智能、大数据分析、计算机系统。
  • 基金资助:
    国家自然科学基金资助项目(61502082)。

Alarm-filtering algorithm of alarm management system for telecom networks

XU Bingke1, ZHOU Yuzhe1, YANG Maolin1, XIE Yuanhang2, LI Xiaoyu1, LEI Hang1   

  1. 1. School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu Sichuan 610054, China;
    2. SI-TECH Information Technology Company Limited, Beijing 100046, China
  • Received:2018-04-28 Revised:2018-06-11 Online:2018-10-10 Published:2018-10-13
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61502082).

摘要: 为了减小电信网络中海量告警数据对的告警根源分析所造成的干扰,提出一种告警过滤算法。首先,基于电信网络告警数据对告警量分布、告警持续时间等特征进行量化分析,提出告警影响力和高频瞬态告警概念。在此基础上,从告警量、平均告警时间、告警影响力以及告警实例持续时间四个维度对告警重要程度进行综合分析,并提出复杂度为On)(n为告警记录数)的告警过滤算法。单因素实验分析显示,告警压缩比与特定告警元的告警量、平均告警时间、告警影响力以及告警实例持续时间具有正相关关系。对比实验结果表明,在相似告警压缩比下,所提算法的准确性比FTD(Flexible Transient flapping Determination)算法最多提高18个百分点,可用于电信行业的告警数据样本分析以及在线告警过滤。

关键词: 故障诊断, 告警过滤, 告警关联, 电信网络, 数据挖掘

Abstract: A large amount of alarms considerably complicate the root-cause analysis in telecom networks, thus a new alarm filtering algorithm was proposed to minimize the interference on the analysis. Firstly, a quantitative analysis for the alarm data, e.g., the quantity distribution and the average duration, was conducted, and the concepts of alarm impact and high-frequency transient alarm were defined. Subsequently, the importance of each alarm instance was evaluated from four perspectives:the amount of the alarms, the average duration of the alarms, the alarm impact, and the average duration of the alarm instance. Accordingly, an alarm filtering algorithm with O (n) computation complexity in principle was proposed, where n is the number of alarms under analysis. Single-factor experimental analysis show that the compression ratio of the alarm data has a positive correlation with the alarm amount of a specific alarm element, the average duration of the alarms, the alarm impact, and the duration of the alarm instance; further, the accuracy of the proposed algorithm is improved by 18 percentage points at most compared with Flexible Transient Flapping Determination (FTD) algorithm. The proposed algorithm can be used both for off-line analysis of historical alarm data and for on-line alarm filtering.

Key words: fault diagnosis, alarm filtering, alarm correlation, telecom network, data mining

中图分类号: