Unit test generation method via path constraint sharding driven LLM

doi:10.11772/j.issn.1001-9081.2025091085

Journal of Computer Applications

Unit test generation method via path constraint sharding driven LLM

XU Xiaolong¹, WANG Junfeng^1,2, WU Peng³, CAO Xiansheng⁴

1.College of Computer Science, Sichuan University 2.National Key Laboratory of Fundamental Science on Synthetic Vision (Sichuan University) 3.School of Artificial Intelligence, Sichuan Tourism University 4.School of Cyber Science and Engineering, Sichuan University

Received:2025-09-18 Revised:2025-09-30 Online:2025-10-30 Published:2025-10-30
About author:XU Xiaolong, born in 2000, M. S. candidate. His research interests include large language models, automatic test case generation. WANG Junfeng, born in 1976, Ph. D., research fellow. His research interests include network and information security, new technology of industrial software, and space information network. WU Peng, born in 1982, Ph. D., lecturer. His research interests include software supply chain security, software testing. CAO Xiansheng, born in 1991, Ph. D. candidate. His research interests include source code vulnerability analysis, deep learning.
Supported by:
National Natural Science Foundation of China (U24B20147, U2133208); Major Science and Technology Special Project of Sichuan Province (2024ZHCG0195, 2024ZDZX0044, 2024ZYD0269)

基于路径约束分片驱动大模型的单元测试生成方法

徐晓龙¹,王俊峰^1,2,吴鹏³,曹先省¹

1.四川大学计算机学院 2.视觉合成图形图像技术国防重点学科实验室(四川大学) 3.四川旅游学院人工智能学院 4.四川大学网络空间安全学

通讯作者: 王俊峰
作者简介:徐晓龙（2000—），男，新疆乌鲁木齐人，硕士研究生，主要研究方向：大语言模型、测试用例自动生成；王俊峰（1976—），男，安徽芜湖人，研究员，博士，主要研究方向：网络信息安全、工业软件新技术；吴鹏（1982—），男，四川成都人，讲师，博士，主要研究方向：软件供应链安全、软件测试；曹先省（1991—），男，山东菏泽人，博士研究生，主要研究方向：源代码脆弱性分析、深度学习。
基金资助:
国家自然科学基金资助项目(U24B20147, U2133208)；四川省重点研发计划项目（2024ZHCG0195,2024ZDZX0044,2024ZYD0269）

Abstract

Abstract: Automated unit test generation is the key to modern software development to improve development efficiency and ensure software quality assurance. Large Language Model (LLM) is applied to automatic test case generation because of its good code understanding ability, however, when dealing with complex functions, it is difficult to cover deep branch paths. Threefore, PYULLM method was proposed, which combines path constraint sharding with LLM generation ability to solve the above problems. Specifically, all the path constraints were coleected systematically by preorder traversal of the Abstract Syntax Tree. On this basis, a fine-grained relationship between code lines and path constraints was established, intelligent partitioning of path constraint set. This slicing mechanism enables LLM to focus on specific path constraints, which significantly improves the coverage of generating unit test cases in complex scenarios. Experimental results show that compared with the sofa tool Pynguin, PyULLM improves the line coverage by 24.16 percentage points and the branch coverage by 26.61 percentage points. Compared with the current state-of-the-art CODAMOSA method, PyULLM improves the coverage by 19.06 percentage points, branch coverage increased by 21.72 percentage points. The results show that PyULLM can effectively generate unit test cases for complex functions.

Key words: unit test generation, large language model, path constraint fragmentation, testing and analysis, test coverage

摘要： 自动化单元测试生成是现代化软件开发提升开发效率，确保软件质量保障的关键，大语言模型(LLM)因具备良好的代码理解能力被应用于测试用例自动生成，但在处理复杂函数时面临深层分支路径难覆盖等问题。本文提出PyULLM方法，将路径约束分片与LLM的生成能力有机结合以解决上述难题。具体而言：本文通过前序遍历抽象语法树，系统化收集所有的路径约束；在此基础上，建立细粒度的代码行-路径约束映射关系；根据得到的映射关系，对路径约束集合进行智能化分片。这种分片处理机制使LLM能聚焦于特定路径约束，显著提升了复杂场景下生成单元测试用例的覆盖率。实验结果表明，PyULLM相比sofa工具Pynguin，行覆盖率提升24.16个百分点，分支覆盖率提升26.61个百分点；相比当前先进的CODAMOSA方法覆盖率提升19.06个百分点，分支覆盖率提升21.72个百分点。可见，PyULLM能为复杂函数有效生成单元测试用例。

关键词: 单元测试生成, 大语言模型, 路径约束, 测试与分析, 测试覆盖率

CLC Number:

XU Xiaolong, WANG Junfeng, WU Peng, CAO Xiansheng. Unit test generation method via path constraint sharding driven LLM[J]. Journal of Computer Applications, DOI: 10.11772/j.issn.1001-9081.2025091085.

徐晓龙王俊峰吴鹏曹先省. 基于路径约束分片驱动大模型的单元测试生成方法[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2025091085.

[1]	Binbin ZHANG, Yongbin QIN, Ruizhang HUANG, Yanping CHEN. Judgment document summarization method combining large language model and dynamic prompts [J]. Journal of Computer Applications, 2025, 45(9): 2783-2789.
[2]	Tao FENG, Chen LIU. Dual-stage prompt tuning method for automated preference alignment [J]. Journal of Computer Applications, 2025, 45(8): 2442-2447.
[3]	Yiheng SUN, Maofu LIU. Tender information extraction method based on prompt tuning of knowledge [J]. Journal of Computer Applications, 2025, 45(4): 1169-1176.
[4]	Peng CAO, Guangqi WEN, Jinzhu YANG, Gang CHEN, Xinyi LIU, Xuechun JI. Efficient fine-tuning method of large language models for test case generation [J]. Journal of Computer Applications, 2025, 45(3): 725-731.
[5]	Jing HE, Yang SHEN, Runfeng XIE. Recognition and optimization of hallucination phenomena in large language models [J]. Journal of Computer Applications, 2025, 45(3): 709-714.
[6]	Wei CHEN, Changyong SHI, Chuanxiang MA. Crop disease recognition method based on multi-modal data fusion [J]. Journal of Computer Applications, 2025, 45(3): 840-848.
[7]	Xuefei ZHANG, Liping ZHANG, Sheng YAN, Min HOU, Yubo ZHAO. Personalized learning recommendation in collaboration of knowledge graph and large language model [J]. Journal of Computer Applications, 2025, 45(3): 773-784.
[8]	Kun SHENG, Zhongqing WANG. Synaesthesia metaphor analysis based on large language model and data augmentation [J]. Journal of Computer Applications, 2025, 45(3): 794-800.
[9]	Xiaolin QIN, Xu GU, Dicheng LI, Haiwen XU. Survey and prospect of large language models [J]. Journal of Computer Applications, 2025, 45(3): 685-696.
[10]	Chengzhe YUAN, Guohua CHEN, Dingding LI, Yuan ZHU, Ronghua LIN, Hao ZHONG, Yong TANG. ScholatGPT： a large language model for academic social networks and its intelligent applications [J]. Journal of Computer Applications, 2025, 45(3): 755-764.
[11]	Yuemei XU, Yuqi YE, Xueyi HE. Bias challenges of large language models： identification， evaluation， and mitigation [J]. Journal of Computer Applications, 2025, 45(3): 697-708.
[12]	Yan YANG, Feng YE, Dong XU, Xuejie ZHANG, Jin XU. Construction of digital twin water conservancy knowledge graph integrating large language model and prompt learning [J]. Journal of Computer Applications, 2025, 45(3): 785-793.
[13]	Chenwei SUN, Junli HOU, Xianggen LIU, Jiancheng LYU. Large language model prompt generation method for engineering drawing understanding [J]. Journal of Computer Applications, 2025, 45(3): 801-807.
[14]	Yanmin DONG, Jiajia LIN, Zheng ZHANG, Cheng CHENG, Jinze WU, Shijin WANG, Zhenya HUANG, Qi LIU, Enhong CHEN. Design and practice of intelligent tutoring algorithm based on personalized student capability perception [J]. Journal of Computer Applications, 2025, 45(3): 765-772.
[15]	Can MA, Ruizhang HUANG, Lina REN, Ruina BAI, Yaoyao WU. Chinese spelling correction method based on LLM with multiple inputs [J]. Journal of Computer Applications, 2025, 45(3): 849-855.

Unit test generation method via path constraint sharding driven LLM

基于路径约束分片驱动大模型的单元测试生成方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics