计算机应用 ›› 2014, Vol. 34 ›› Issue (11): 3283-3286.DOI: 10.11772/j.issn.1001-9081.2014.11.3283

• 计算机安全 • 上一篇    下一篇

基于熵的二叉树多类支持向量机的漏洞分类

张鹏,谢晓尧   

  1. 贵州省信息与计算科学重点实验室(贵州师范大学),贵阳 550001
  • 收稿日期:2014-05-26 修回日期:2014-06-26 出版日期:2014-11-01 发布日期:2014-12-01
  • 通讯作者: 张鹏
  • 作者简介:张鹏(1990-),男,江苏南京人,硕士研究生,主要研究方向:网络通信、信息安全;谢晓尧(1952-),男,贵州贵阳人,教授,博士,主要研究方向:网络通信、信息安全。
  • 基金资助:

    国家科技支撑计划项目

Vulnerability classification based on binary tree with entropy multi-class support vector machine

ZHANG Peng,XIE Xiaoyao   

  1. Key Laboratory of Information and Computing Science of Guizhou Province (Guizhou Normal University), Guiyang Guizhou 550001, China
  • Received:2014-05-26 Revised:2014-06-26 Online:2014-11-01 Published:2014-12-01
  • Contact: ZHANG Peng

摘要:

为了有效提高漏洞分类的准确性,针对基于二叉树多类支持向量机分类算法的分类复杂性和分类结果依赖二叉树的结构等缺点,提出了一种基于熵的二叉树多类支持向量机的漏洞分类算法。根据定义最小超球体进行漏洞样本空间的分类,有效地通过熵的计算来描述漏洞之间的混杂程度,使得漏洞分类的计算过程被简化且能够有效减少分类结果对二叉树结构的依赖。采用公共漏洞枚举(CWE)漏洞分类体系在收集到的3000个漏洞样本上进行大量仿真实验,漏洞分类的平均准确率和平均召回率达93.3%和93.25%,高于基于二叉树多类支持向量机分类算法和K-近邻(KNN)分类算法得到的平均值。实验结果表明所提算法有效可行,能精确地实现漏洞的分类。

Abstract:

To effectively improve the accuracy of the vulnerabilities' classification, the vulnerability classification algorithm based on binary tree with entropy multi-class support vector machine was proposed to solve the insufficiency of the classification algorithm based on binary tree multi-class support vector machine, which could effectively reduce the complexity of the classification and dependence on the structure of binary tree. According to defining the smallest hyper sphere to classify the vulnerability's sample space, and used the entropy to describe the confusion degree among vulnerabilities, thus the calculation process of vulnerability classification was simplified, and the classification results' dependence on the structure of the binary tree was reduced. Experiments were conducted on 3000 vulnerabilities' samples using Common Weakness Enumeration (CWE), the average vulnerability classification accuracy and the average recall rate respectively were 93.3% and 93.25%, which were higher than those of the classification algorithm based on the binary tree multi-class support vector machine and the classification algorithm based on K-Nearest Neighbor (KNN). The experimental results indicate that the proposed algorithm is practical and feasible, it can achieve the precise vulnerability classification.

中图分类号: