基于自适应敏感区域变异的覆盖引导模糊测试

• •

基于自适应敏感区域变异的覆盖引导模糊测试

徐航¹,杨智²,陈性元²,韩冰¹,杜学绘³,⁴

1. 战略支援部队信息工程大学
2. 信息工程大学
3. 信息工程大学，郑州 450001；
4. 数学工程与先进计算国家重点实验室，郑州 450001；

收稿日期:2023-08-31 修回日期:2023-10-30 发布日期:2023-12-18
通讯作者: 杨智
基金资助:
国家自然科学基金

Coverage-guided Fuzzing Based on Adaptive Sensitive Region Mutation

Received:2023-08-31 Revised:2023-10-30 Online:2023-12-18

摘要/Abstract

摘要： 针对覆盖引导的模糊测试（CGF）中存在大量无效变异且造成性能浪费的问题，提出了一种自适应敏感区域变异算法。首先根据变异出的测试用例是否执行新路径将对应的变异位置分为有效变异位置集合和无效变异位置集合，然后基于有效变异位置确定敏感区域，并将后续的变异集中在敏感区域内。在后续的模糊测试过程中，根据测试用例的执行结果自适应地调整对应种子的敏感区域，实现减少无效变异的目的。此外，设计了新的种子选择策略来配合敏感区域变异。将敏感区域算法集成到AFL上，并将其命名为Sensitive-region-based Mutation American Fuzzy Lop（SMAFL）。在12个流行的应用程序上对SMAFL进行评估，结果表明在相同的时间内，SMAFL平均比AFL多发现了39.3%的路径，SMAFL的模糊次数为AFL的3至4倍，并且SMAFL在12个程序中都实现了更高的代码覆盖率。在对LAVA-M的测试中，SMAFL发现了更多的bug，并且发现bug所用时间更短。

关键词: 模糊测试, 自适应算法, 软件漏洞, 代码覆盖率, 变异

Abstract: Abstract: To solve the problem that there are a lot of invalid mutations, and the performance is wasted in Coverage-guided Fuzzing (CGF), an adaptive sensitive region mutation algorithm is proposed. Firstly, the mutation locations are divided into effective mutation location set and invalid mutation location set according to whether the mutated test case executed a new path. Then, the sensitive region is determined based on the effective mutation location, and the subsequent mutation is concentrated in the sensitive region. In the subsequent fuzzing process, the sensitive region of the corresponding seed is adjusted adaptively according to the execution result of the test case, so as to reduce the invalid mutations. In addition, a new seed selection strategy was designed to assistant the sensitive region mutation algorithm. The adaptive sensitive region mutation algorithm was integrated into the AFL and named Sensitive-region-based Mutation American Fuzzy Lop (SMAFL). SMAFL was evaluated on 12 popular applications and the results show that, on average, SMAFL found 39.3% more paths than AFL in the same amount of time, SMAFL fuzzed three to four times more than AFL, and SMAFL achieved higher code coverage across all 12 programs. In testing LAVA-M, SMAFL found more bugs and found them in less time.

Key words: fuzzing, adaptive algorithm, software vulnerability, code coverage, mutation

中图分类号:

TP393.08

徐航杨智陈性元韩冰杜学绘. 基于自适应敏感区域变异的覆盖引导模糊测试[J]. 计算机应用.

[1]	田甜, 邵阳阳, 王苗苗, 杨欢. 基于程序依赖关系的变异体生成策略[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2863-2870.
[2]	徐航, 杨智, 陈性元, 韩冰, 杜学绘. 基于自适应敏感区域变异的覆盖引导模糊测试[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2528-2535.
[3]	杨乐, 张达敏, 何庆, 邓佳欣, 左锋琴. 改进猎人猎物优化算法在WSN覆盖中的应用[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2506-2513.
[4]	李牧, 骆宇, 柯熙政. 基于调频连续波雷达的人体生命体征检测算法[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1978-1986.
[5]	蔡锦辉, 尹中旭, 宗国笑, 李俊儒. 面向嵌套分支突破的推断与污点分析融合的方法[J]. 《计算机应用》唯一官方网站, 2024, 44(12): 3823-3830.
[6]	刘羿希, 何俊, 吴波, 刘丙童, 李子玉. DevSecOps中软件安全性测试技术综述[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3470-3478.
[7]	李大海, 詹美欣, 王振东. 基于多个改进策略的增强麻雀搜索算法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2845-2854.
[8]	高昊, 张庆科, 卜降龙, 李俊青, 张化祥. 基于协同变异与莱维飞行策略的教与学优化算法及其应用[J]. 《计算机应用》唯一官方网站, 2023, 43(5): 1355-1364.
[9]	张玉杰, 王帆. 基于改进麻雀搜索算法的照明控制优化[J]. 《计算机应用》唯一官方网站, 2023, 43(3): 835-841.
[10]	张仲华, 赵福媛, 郭钧枫, 赵高长. 柯西自适应回溯搜索与最小二乘支持向量机的集成预测模型[J]. 《计算机应用》唯一官方网站, 2022, 42(6): 1829-1836.
[11]	陈亮, 汤显峰. 改进正余弦算法优化特征选择及数据分类[J]. 《计算机应用》唯一官方网站, 2022, 42(6): 1852-1861.
[12]	倪萍, 陈伟. 基于模糊测试的反射型跨站脚本漏洞检测[J]. 计算机应用, 2021, 41(9): 2594-2601.
[13]	付安兵, 魏文红, 张宇辉, 郭文静. 基于准反向变异的实数笛卡尔遗传编程算法[J]. 计算机应用, 2021, 41(2): 479-485.
[14]	李丽荣, 杨坤, 王培崇. 融合头脑风暴思想的教与学优化算法[J]. 计算机应用, 2020, 40(9): 2677-2682.
[15]	郭秀婷, 朱昶胜, 张生财, 赵奎鹏. 分形插值在风速时间序列中的应用[J]. 计算机应用, 2020, 40(9): 2628-2633.