基于源码分析的自动化外部函数接口生成方法

doi:10.11772/j.issn.1001-9081.2023070968

《计算机应用》唯一官方网站

• • 下一篇

基于源码分析的自动化外部函数接口生成方法

孙蒴¹,张伟¹,冯温迪¹,张俞炜²

1. 北京信息科技大学
2. 北京大学

收稿日期:2023-07-19 修回日期:2023-09-19 发布日期:2023-10-26 出版日期:2023-10-26
通讯作者: 孙蒴

Automatic foreign function interface generation method based on source code analysis

Received:2023-07-19 Revised:2023-09-19 Online:2023-10-26 Published:2023-10-26

摘要/Abstract

摘要： 外部函数接口(FFI)是解决一种编程语言调用其他语言函数库的主要方法。针对使用FFI技术时需要大量人工编码的问题，提出了自动化外部函数接口生成方法(AFIG)。该方法利用基于抽象语法树的源码逆向分析技术，从被封装的库文件中精准提取出用于描述函数接口信息的多语言融合的统一表示。基于此统一表示，不同平台的代码生成器可利用多语言转换规则矩阵，全自动化地生成不同平台的FFI相关代码。为解决FFI代码生成中的效率低下问题，设计了一种基于依赖分析的任务聚合策略，通过把存在依赖的任务聚合为新的任务，有效消除了FFI代码任务在并行下的阻塞与死锁，从而实现任务在多核下的可扩展与负载均衡。实验结果表明：AFIG减少了FFI开发中98.14%的开发编码量以及41.95%的测试编码量；与现有的SWIG方法相比，在同等任务下可减少61.27%的开发成本，且使其生成效率随着计算资源的增加呈线性增长。

Abstract: Foreign Function Interface (FFI) is a fundamental method to invoke interfaces provided in another programming languages. Focusing on the issue that huge amount of manual coding was required when using FFI, Automatic a Foreign function Interface Generation (AFIG) was proposed. A reverse source code analysis technique based on the abstract syntax tree was employed by AFIG to accurately retrieve the multilingual intermediate representation from library binaries, in which function interface information was described. Based on the representation, the multilingual conversion rule matrix could be utilized by different platform code generators to automatically generate FFI codes for various platforms without handcrafting. To further reduce generation time usage, a dependency analysis-based task aggregation strategy was proposed, by which tasks with dependencies were consolidated as monolithic ones. Hence, blocking and deadlocks were efficiently eliminated, and load balancing and scalability on multi-core systems were achieved, accordingly. Experimental results indicate that a reduction of 98.14% for FFI manual codes and 41.95% for testing codes are achieved. Compared to SWIG, AFIG can further reduce development cost by 61.27% under the same task. Besides, the code generation mechanism is proven to be scalable because experimental results indicate a linear time reduction as the number of central processing unit cores increases.

中图分类号:

TP311.56

孙蒴张伟冯温迪张俞炜. 基于源码分析的自动化外部函数接口生成方法[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2023070968.

[1]	张家奇, 牟永敏, 张志华. 基于控制流的软件设计与实现一致性分析方法[J]. 计算机应用, 2020, 40(10): 3025-3033.
[2]	朱小杰, 赵子豪, 杜一. 模型驱动的大数据流水线框架PiFlow[J]. 计算机应用, 2020, 40(6): 1638-1647.
[3]	张文烨. 基于图像识别的移动端应用控件检测方[J]. 计算机应用, 0, (): 0-0.
[4]	王岩, 黄章进, 顾乃杰. 基于同余方程和改进的压扁控制流的混淆算法[J]. 计算机应用, 2017, 37(6): 1803-1807.
[5]	程勇, 秦丹, 杨光. 针对JavaScript浏览器兼容性的变异测试方法[J]. 计算机应用, 2017, 37(4): 1143-1148.
[6]	曹光辉, 李春强. 联合空域和小波域的图像加密[J]. 计算机应用, 2017, 37(2): 499-504.
[7]	徐远超, 孙凤芸, 闫俊峰, 万虎. 面向Android系统的目录自适应日志模式选择机制[J]. 计算机应用, 2015, 35(10): 3008-3012.
[8]	董跃华, 戴玉倩. 混合粒子群算法的软件测试数据自动生成[J]. 计算机应用, 2015, 35(2): 545-549.
[9]	赖春雷薛荷周益民. 视频移动终端实时定点与缩放[J]. 计算机应用, 2014, 34(7): 2028-2032.
[10]	张仕金尚赵伟. 基于区间集的Cppcheck数组边界缺陷检测[J]. 计算机应用, 2013, 33(11): 3257-3261.
[11]	范铁生张忠清孙静罗雪春陆贵强张璞. 云模型图像置乱算法[J]. 计算机应用, 2013, 33(09): 2497-2500.
[12]	周海光. 新一代多普勒天气雷达网探测数据对比分析系统[J]. 计算机应用, 2013, 33(01): 270-275.
[13]	陕光凌玲胡于进. 内存映射文件在提取有限元模态结果中的应用[J]. 计算机应用, 2012, 32(05): 1429-1431.
[14]	杨怡君黄大庆. Android手机自动化性能测试工具的研究与开发[J]. 计算机应用, 2012, 32(02): 554-556.
[15]	孙红利王忠民王文浪. 嵌入式软件语句覆盖率测试插桩技术[J]. 计算机应用, 2010, 30(10): 2738-2740.

基于源码分析的自动化外部函数接口生成方法

Automatic foreign function interface generation method based on source code analysis

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics