计算机应用 ›› 2010, Vol. 30 ›› Issue (1): 181-185.

• 信息安全 • 上一篇    下一篇

多类支持向量机的病毒行为检测方法

韩兰胜1,邹梦松2,刘其文3,刘铭3   

  1. 1. 华中科技大学
    2. 华中科技大学 Huazhong University of Science & Technology
    3.
  • 收稿日期:2009-07-07 修回日期:2009-08-19 发布日期:2010-01-01 出版日期:2010-01-01
  • 通讯作者: 邹梦松
  • 基金资助:
    网络病毒的追踪与寻源技术研究;计算机病毒求源与追踪技术模型;移动自组网中数据可信传输的关键技术研究

Behavior-based virus detection method using multi-class support vector machine

  • Received:2009-07-07 Revised:2009-08-19 Online:2010-01-01 Published:2010-01-01
  • Contact: Meng-Song ZOU

摘要: 与正常程序相比,病毒具有一些特殊的、有限的行为。运用支持向量机的方法,构建出病毒的特征行为空间,采用信息熵来放大病毒行为与正常程序的区别,通过学习分类寻找并建立将不同程序行为切分的超平面,再对不同类型病毒的特征行为进行区分。通过对大量正常程序与病毒程序中的API调用的统计和分析,发现了病毒的API调用数量和分布的特征,将行为特征集中API调用序列设定为2100就可以将所实验的病毒检出,这保证了检测集的稳定性和检测的可行性。与已有的病毒检测方法进行比较,所提出的方法更加具有操作性。

关键词: 计算机病毒, 行为检测, 信息熵, 支持向量机, 多类分类

Abstract: In order to achieve specific functions, computer viruses are of some special behaviors different from those of the normal programs. Appling Support Vector Machine (SVM), the paper created a space of virus API feature vector and amplified the difference between normal programs and computer virus with the help of information entropy. By training a classifier, a hyperplane was found, which could divide the API space into two parts, each of which represented one kind of the programs. Moreover, the paper collected behaviors of different kinds of viruses. Through statistics, analysis and calculation on amount of samples' API calls, the amount and distribution patterns of APIs were exposed. As most viruses' behaviors are finite, the paper set 2100 as the length of API sequence, thus detecting most test viruses. Compared with previous virus detection methods, the proposed method is more practical.

Key words: Computer virus, Behavior-based detection, Information entropy, Support Vector Machine(SVM), Multi-class classification