基于深度语义融合的代码缺陷静态检测方法

doi:10.11772/j.issn.1001-9081.2021081548

《计算机应用》唯一官方网站 ›› 2022, Vol. 42 ›› Issue (10): 3170-3176.DOI: 10.11772/j.issn.1001-9081.2021081548

基于深度语义融合的代码缺陷静态检测方法

程靖云, 王布宏, 罗鹏

空军工程大学信息与导航学院，西安 710077

收稿日期:2021-08-31 修回日期:2021-11-20 接受日期:2021-11-21 发布日期:2022-01-07 出版日期:2022-10-10
通讯作者: 程靖云
作者简介:第一联系人：程靖云（1998—），男，重庆人，硕士研究生，主要研究方向：信息安全; 1508458583@qq.com
王布宏（1975—），男，山西太原人，教授，博士，主要研究方向：信息安全、物理层安全、人工智能安全
罗鹏（1995—），男，江苏盐城人，博士研究生，主要研究方向：信息安全。

Static code defect detection method based on deep semantic fusion

Jingyun CHENG, Buhong WANG, Peng LUO

College of Information and Navigation，Air Force Engineering University，Xi’an Shaanxi 710077，China

Received:2021-08-31 Revised:2021-11-20 Accepted:2021-11-21 Online:2022-01-07 Published:2022-10-10
Contact: Jingyun CHENG
About author:CHENG Jingyun, born in 1998， M. S. candidate. His research interests include information safety.
WANG Buhong, born in 1975， Ph. D. ， professor. His research interests include information safety， physical layer security， artificial intelligence security.
LUO Peng， born in 1995， Ph. D. candidate. His research interests include information safety.

摘要/Abstract

摘要：

随着计算机软件规模和复杂度的不断增加，软件中存在的代码缺陷对公共安全形成了严重威胁。针对静态分析工具拓展性差，以及现有方法检测粒度粗、检测效果不够理想的问题，提出了一种基于程序切片和语义特征融合的代码缺陷静态检测方法。首先，对源代码中的关键点进行数据流和控制流分析，并采用基于过程间有限分布子集（IFDS）的切片方法，以获取由多行与代码缺陷相关的语句组成的代码片段；然后，通过词嵌入法获取代码片段语义相关的向量表示，从而在保证准确率的同时选择合适的代码片段长度；最后，利用文本卷积神经网络（TextCNN）和双向门控循环单元（BiGRU）分别提取代码片段中的局部关键特征和上下文序列特征，并将所提方法用于检测切片级别的代码缺陷。实验结果表明，所提方法能够有效检测不同类型的代码缺陷，并且检测效果显著优于静态分析工具Flawfinder；在细粒度的前提下，IFDS切片方法能进一步提高F₁值和准确率，分别达到了89.64%和92.08%；与现有的基于程序切片的方法相比，在关键点为应用程序编程接口（API）或变量时，所提方法的F₁值分别达到89.69%、89.74%，准确率分别达到92.15%、91.98%。可见在不显著增加时间复杂度的同时，所提方法具备更好的综合检测性能。

关键词: 缺陷检测, 程序切片, 语义分析, 深度学习, 特征融合

Abstract:

With the increasing scale and complexity of computer softwares， code defect in software has become a serious threat to public safety. Aiming at the problems of poor expansibility of static analysis tools， as well as coarse detection granularity and unsatisfactory detection effect of existing methods， a static code defect detection method based on program slicing and semantic feature fusion was proposed. Firstly， key points in source code were analyzed through data flow and control flow， and the program slicing method based on Interprocedural Finite Distributive Subset （IFDS） was adopted to obtain the code snippet composed of multiple lines of statements related to code defects. Then， semantically related vector representation of code snippet was obtained by word embedding， so that the appropriate length of code snippet was selected with the accuracy guaranteed. Finally， Text Convolutional Neural Network （TextCNN） and Bi-directional Gate Recurrent Unit （BiGRU） were used to extract local key features and context sequence features of the code snippet respectively， and the proposed method was used to detect slice-level code defects. Experimental results show that the proposed method can detect different types of code defects effectively， and is significantly better than static analysis tool Flawfinder. Under the premise of fine granularity， IFDS slicing method can further improve F₁ score and accuracy，reach 89.64% and 92.08% respectively. Compared with the existing methods based on program slicing， when key points are the Application Programming Interface （API） or the variables， the proposed method has the F₁ score reached 89.69% and 89.74% respectively， and the accuracy reached 92.15% and 91.98% respectively， and all of them are higher. It can be seen that without significantly increasing time complexity， the proposed method has a better comprehensive detection performance.

Key words: defect detection, program slicing, semantic analysis, deep learning, feature fusion

中图分类号:

TP393.08

程靖云, 王布宏, 罗鹏. 基于深度语义融合的代码缺陷静态检测方法[J]. 计算机应用, 2022, 42(10): 3170-3176.

Jingyun CHENG, Buhong WANG, Peng LUO. Static code defect detection method based on deep semantic fusion[J]. Journal of Computer Applications, 2022, 42(10): 3170-3176.

图/表 13

参考文献 23

1	ABU-DABASEH F， ALSHAMMARI E. Automated penetration testing： an overview［C］// Proceedings of the 4th International Conference on Natural Language Computing. Chennai， Tamil Nadu： AIRCC Publishing Corporation， 2018： 121-129. 10.5121/csit.2018.80610
2	李韵，黄辰林，王中锋，等. 基于机器学习的软件漏洞挖掘方法综述［J］. 软件学报， 2020， 31（7）：2040-2061.
	LI Y， HUANG C L， WANG Z F， et al. Survey of software vulnerability mining methods based on machine learning［J］. Journal of Software， 2020， 31（7）：2040-2061.
3	SEMASABA A O A， ZHENG W， WU X X， et al. Literature survey of deep learning-based vulnerability analysis on source code［J］. IET Software， 2020， 14（6）： 654-664. 10.1049/iet-sen.2020.0084
4	Details CVE. Browse vulnerabilities by date［EB/OL］. ［2021-07-24］..
5	YAMAGUCHI F. Pattern-based methods for vulnerability discovery［J］. it—Information Technology， 2017， 59（2）： 101-106. 10.1515/itit-2016-0037
6	蒋考林，白玮，张磊，等. 基于多通道图像深度学习的恶意代码检测［J］. 计算机应用， 2021， 41（4）：1142-1147.
	JIANG K L， BAI W， ZHANG L， et al. Malicious code detection based on multi-channel image deep learning［J］. Journal of Computer Applications， 2021， 41（4）：1142-1147.
7	KIM S， WOO S， LEE H， et al. VUDDY： a scalable approach for vulnerable code clone discovery［C］// Proceedings of the 2017 IEEE Symposium on Security and Privacy. Piscataway： IEEE， 2017：595-614. 10.1109/sp.2017.62
8	GRIECO G， GRINBLAT G L， UZAL L， et al. Toward large-scale vulnerability discovery using machine learning［C］// Proceedings of the 6th ACM Conference on Data and Application Security and Privacy. New York： ACM， 2016： 85-96. 10.1145/2857705.2857720
9	SCANDARIATO R， WALDEN J， HOVSEPYAN A， et al. Predicting vulnerable software components via text mining［J］. IEEE Transactions on Software Engineering， 2014， 40（10）： 993-1006. 10.1109/tse.2014.2340398
10	MIRSKY Y， DEMONTIS A， KOTAK J， et al. The threat of offensive AI to organizations［EB/OL］. （2021-06-30）［2021-07-26］..
11	RUSSELL R， KIM L， HAMILTON L， et al. Automated vulnerability detection in source code using deep representation learning［C］// Proceedings of the 2018 17th IEEE International Conference on Machine Learning and Applications. Piscataway： IEEE， 2018： 757-762. 10.1109/icmla.2018.00120
12	ZHOU Y Q， LIU S Q， SIOW J， et al. Devign： effective vulnerability identification by learning comprehensive program semantics via graph neural networks［C/OL］// Proceedings of the 33rd Conference on Neural Information Processing Systems. ［2021-07-27］..
13	许健，陈平华，熊建斌. 融合滑动窗口和哈希函数的代码漏洞检测模型［J］. 计算机应用研究， 2021， 38（8）：2394-2400.
	XU J， CHEN P H， XIONG J B. Code vulnerability detection model based on sliding window and hash function［J］. Application Research of Computers， 2021， 38（8）：2394-2400.
14	LI Z， ZOU D Q， XU S H， et al. VulDeePecker： a deep learning-based system for vulnerability detection［EB/OL］. （2018-01-05）［2021-07-27］.. 10.14722/ndss.2018.23158
15	李元诚，崔亚奇，吕俊峰，等. 开源软件漏洞检测的混合深度学习方法［J］. 计算机工程与应用， 2019， 55（11）：52-59.
	LI Y C， CUI Y Q， LYU J F， et al. Combined deep learning method for open source software vulnerability detection［J］. Computer Engineering and Applications， 2019， 55（11）：52-59.
16	王晓萌，管志斌，辛伟，等. 基于深度卷积神经网络的源代码缺陷检测方法［J］. 清华大学学报（自然科学版）， 2021， 61（11）： 1267-1272.
	WANG X M， GUAN Z B， XIN W， et al. Source code defect detection using deep convolutional neural networks［J］. Journal of Tsinghua University （Science and Technology）， 2021， 61（11）： 1267-1272.
17	LI X， WANG L， XIN Y， et al. Automated vulnerability detection in source code using minimum intermediate representation learning［J］. Applied Sciences， 2020， 10（5）： No.1692. 10.3390/app10051692
18	JEON S， KIM H K. AutoVAS： an automated vulnerability analysis system with a deep learning approach［J］. Computers and Security， 2021， 106： No.102308. 10.1016/j.cose.2021.102308
19	CHANDRA A， SINGHAL A， BANSAL A. A study of program slicing techniques for software development approaches［C］// Proceedings of the 1st International Conference on Next Generation Computing Technologies. Piscataway： IEEE， 2015： 622-627. 10.1109/ngct.2015.7375196
20	MIKOLOV T， CHEN K， CORRADO G， et al. Efficient estimation of word representations in vector space［EB/OL］. （2013-09-07）［2021-07-29］.. 10.3126/jiee.v3i1.34327
21	KIM Y. Convolutional neural networks for sentence classification［C］// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Stroudsburg， PA： Association for Computational Linguistics， 2014：1746-1751. 10.3115/v1/d14-1181
22	National Institute of Standards and Technology. Software assurance reference dataset［DS/OL］. ［2021-08-02］.. 10.1109/dasc.2007.4391957
23	WHEELER D A. Flawfinder［EB/OL］. ［2017-8-26］..

变量	后向	前向
argc@main	｛14｝	｛6，8，9，10，11，12， 17，19，20，21，22｝
argv@main	｛14｝	｛9，19｝
buf@test	｛6，8，14，17，20｝	｛8｝
str@test	｛6，14，17，19，20｝	｛9｝
userstr@main	｛14，17，19｝	｛19｝

变量	后向	前向
argc@main	｛14｝	｛6，8，9，10，11，12， 17，19，20，21，22｝
argv@main	｛14｝	｛9，19｝
buf@test	｛6，8，14，17，20｝	｛8｝
str@test	｛6，14，17，19，20｝	｛9｝
userstr@main	｛14，17，19｝	｛19｝

实际	预测
实际	脆弱	非脆弱
脆弱	TP	FN
非脆弱	FP	TN

实际	预测
实际	脆弱	非脆弱
脆弱	TP	FN
非脆弱	FP	TN

参数名	值	参数名	值
滤波器数量（N）	128	迭代轮次	20
卷积窗口大小（m）	1、3、5	激活函数	ReLU
GRU神经元个数（u）	50	卷积方式	MaxPooling1D
全连接层神经元个数	484	优化函数	Adamax
Dropout	0.5	损失函数	categorical_crossentrop
Batch Size	256

基于深度语义融合的代码缺陷静态检测方法

Static code defect detection method based on deep semantic fusion

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 13

参考文献 23

相关文章 15

编辑推荐

Metrics

缺陷类型	F₁	Acc	Rec	Pre
缓冲区溢出	86.59	89.04	89.29	84.05
格式化字符串	89.86	91.88	91.21	88.55
内存管理	88.06	90.79	91.74	84.67
错误处理不当	89.95	93.57	93.14	86.98
命令执行	95.49	94.38	97.73	93.35
混合	96.13	96.59	97.69	94.62

切片类型	耗时/s	Token数	复用比/%	F₁/%	Acc/%
IFDS_Bo	757	378	72.88	89.64	91.94
IFDS_Bw	634	233	80.03	89.51	92.08
IFDS_Fw	671	351	93.91	43.67	65.40
SDG_Bo	737	378	73.00	89.50	91.92
SDG_Bw	652	233	80.19	89.36	92.01
SDG_Fw	695	351	93.91	45.80	65.04
Weiser_Bw	949	257	84.01	54.11	69.59

检测方法	模型	关键点类别	Token数	嵌入方法	F₁/%	Acc/%	每批平均训练时间/ms	每个平均检测时间/ms
基于规则	Flawfinder^［23］	―	―	―	38.01	58.65	―	0.126
基于深度学习	DCnnGRU^［15］	API	363	Skip-gram	85.89	88.38	54.77	0.129
	TextCNN+SVM^［17］	API	363	CBOW	89.90	92.34	26.76+2 227.20	2.446
	BiGRU^［18］	API	363	FastText	88.53	91.09	35.34	0.106
	本文模型	API	363	Skip-gram	89.69	92.15	62.11	0.167
本文方法	本文模型	变量	378	Skip-gram	89.74	91.98	61.81	0.189
	DCnnGRU	变量	378	Skip-gram	87.68	89.31	55.68	0.139
	TextCNN+SVM	变量	378	Skip-gram	89.35	92.17	27.06+2 059.67	2.509
	BiGRU	变量	378	Skip-gram	88.79	91.16	35.87	0.117

[1]	黄云川, 江永全, 黄骏涛, 杨燕. 基于元图同构网络的分子毒性预测[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2964-2969.
[2]	潘烨新, 杨哲. 基于多级特征双向融合的小目标检测优化模型[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2871-2877.
[3]	秦璟, 秦志光, 李发礼, 彭悦恒. 基于概率稀疏自注意力神经网络的重性抑郁疾患诊断[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2970-2974.
[4]	王熙源, 张战成, 徐少康, 张宝成, 罗晓清, 胡伏原. 面向手术导航3D/2D配准的无监督跨域迁移网络[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2911-2918.
[5]	李顺勇, 李师毅, 胥瑞, 赵兴旺. 基于自注意力融合的不完整多视图聚类算法[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2696-2703.
[6]	刘禹含, 吉根林, 张红苹. 基于骨架图与混合注意力的视频行人异常检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2551-2557.
[7]	邓凯丽, 魏伟波, 潘振宽. 改进掩码自编码器的工业缺陷检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2595-2603.
[8]	顾焰杰, 张英俊, 刘晓倩, 周围, 孙威. 基于时空多图融合的交通流量预测[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2618-2625.
[9]	石乾宏, 杨燕, 江永全, 欧阳小草, 范武波, 陈强, 姜涛, 李媛. 面向空气质量预测的多粒度突变拟合网络[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2643-2650.
[10]	吴筝, 程志友, 汪真天, 汪传建, 王胜, 许辉. 基于深度学习的患者麻醉复苏过程中的头部运动幅度分类方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2258-2263.
[11]	李欢欢, 黄添强, 丁雪梅, 罗海峰, 黄丽清. 基于多尺度时空图卷积网络的交通出行需求预测[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2065-2072.
[12]	张郅, 李欣, 叶乃夫, 胡凯茜. 基于暗知识保护的模型窃取防御技术DKP[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2080-2086.
[13]	赵亦群, 张志禹, 董雪. 基于密集残差物理信息神经网络的各向异性旅行时计算方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2310-2318.
[14]	徐松, 张文博, 王一帆. 基于时空信息的轻量视频显著性目标检测网络[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2192-2199.
[15]	刘瑞华, 郝子赫, 邹洋杨. 基于多层级精细特征融合的步态识别算法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2250-2257.