计算机应用 ›› 2015, Vol. 35 ›› Issue (10): 2896-2900.DOI: 10.11772/j.issn.1001-9081.2015.10.2896

• 信息安全 • 上一篇    下一篇

基于划分的二进制文件相似性比较方法

董启海, 王亚刚   

  1. 西安邮电大学 计算机学院, 西安 710121
  • 收稿日期:2015-05-22 修回日期:2015-08-18 出版日期:2015-10-10 发布日期:2015-10-14
  • 通讯作者: 董启海(1988-),男,山东高唐人,硕士研究生,主要研究方向:信息安全、嵌入式系统,dongqihaia@126.com
  • 作者简介:王亚刚(1972-),男,陕西乾县人,副教授,博士,CCF会员,主要研究方向:信息安全、嵌入式系统、编译器与并行计算。
  • 基金资助:
    国家自然科学基金资助项目(61136002);陕西省教育厅科研计划项目(14JK1674)。

Partition-based binary file similarity comparison method

DONG Qihai, WANG Yagang   

  1. School of Computer Science and Technology, Xi'an University of Posts and Telecommunications, Xi'an Shaanxi 710121, China
  • Received:2015-05-22 Revised:2015-08-18 Online:2015-10-10 Published:2015-10-14

摘要: 针对传统文件结构化相似性比较法中采用基本块(BB)一对一映射而造成的巨大时空消耗及基本块比较结果的绝对化问题,提出一种基于划分思想的文件结构化相似性比较方法。该方法首先对用于基本块比较的小素数积法进行改进,通过改进方法将函数内的基本块进行分类,再结合基本块签名与属性的权重求得基本块间的相似率,从而计算出最终的函数相似率及文件相似率。通过函数相似率比较实验分析,与未考虑划分思想的绝对化基本块比较算法相比,该方法在比较效率及准确率上均有所提升。实验结果表明,该方法在减少比较时间的同时提高了比较准确率,在实际二进制文件相似性比较的应用中更可行。

关键词: 划分, 基本块, 小素数积法, 签名, 二进制文件

Abstract: Focusing on the huge consumption of time and space and the absolute comparison result of Basic Block (BB) resulted from one-to-one mapping of basic block in traditional file structural similarity detection technique, a method based on dividing thought was proposed to accomplish structural comparison of binary file. Firstly, the small primes algorithm which is used to compare the basic block was improved to classify the basic blocks in a function, then the similar rates of basic blocks were obtained by combining with the weights of basic block signature and attribute, so as to get the final function similar rate and final file similar rate. In the comparison with the absolute comparison method which does not consider the partition at the similar rates of function, this proposed method has certain advantages in efficiency and accuracy. The experimental show that the proposed method improves the accuracy of comparison, and reduces the comparison time, it is more feasible in similarity comparison of binary files.

Key words: partition, Basic Block (BB), small primes algorithm, signature, binary file

中图分类号: