基于语义增强模式链接的Text-to-SQL模型

doi:10.11772/j.issn.1001-9081.2023091360

《计算机应用》唯一官方网站

• • 下一篇

基于语义增强模式链接的Text-to-SQL模型

吴相岚,肖洋,刘梦莹,刘明铭

南开大学软件学院

收稿日期:2023-10-07 修回日期:2023-12-07 发布日期:2024-03-15 出版日期:2024-03-15
通讯作者: 刘明铭
作者简介:吴相岚（2000—），男，安徽合肥人，硕士研究生，主要研究方向：自然语言处理、SQL语句生成；肖洋(1998—)，男，河北秦皇岛人，硕士研究生，主要研究方向：虚假新闻检测、SQL语句生成；刘梦莹(1999—)，女，山东济宁人，硕士研究生，主要研究方向：智能软件工程、虚假新闻检测；刘明铭(1979—)，女，山东东阿人，讲师，博士，主要研究方向：数据挖掘。

Text-to-SQL model based on semantic enhanced schema linking

WU Xianglan, XIAO Yang, LIU Mengying, LIU Mingming

College of Software, Nankai University

Received:2023-10-07 Revised:2023-12-07 Online:2024-03-15 Published:2024-03-15
About author:WU Xianglan, born in 2000, M. S. candidate. His research interests include natural language processing, SQL statement generation. XIAO Yang, born in 1998, M. S. candidate. His research interests include fake news detection, SQL statement generation. LIU Mengying, born in 1999, M. S. candidate. Her research interests include intelligent software engineering, fake news detection. LIU Mingming, born in 1979, Ph. D., lecturer. Her research interests include data mining.

摘要/Abstract

摘要： 为优化基于异构图编码器的Text-to-SQL生成问题，提出了SELSQL模型。首先，模型采用端到端的学习框架，使用双曲空间下的庞加莱距离度量替代欧氏度量，以此优化了使用探针技术从预训练语言模型中构建的语义增强的模式链接图；其次，利用K头加权的余弦相似度以及图正则化方法学习相似度度量图使得初始模式链接图在训练中迭代优化；最后，模型使用改良的关系图注意网络（RGAT）图编码器以及多头注意力机制对两个模块的联合语义模式链接图进行编码，并且使用了基于语法的神经语义解码器和预定义的结构化语言进行SQL语句解码。在Spider数据集上的实验结果表明，使用ELECTRA-large预训练模型时，SELSQL模型比最佳基线模型的准确率提升了2.5个百分点，对于复杂SQL语句生成的提升效果很大，并且在更一般的使用场景下有更好的鲁棒性。

关键词: 模式链接, 图结构学习, 预训练语言模型, Text-to-SQL, 异构图

Abstract: To optimize Text-to-SQL generation problem based on heterogeneous graph encoder, the SELSQL model was proposed. Firstly, an end-to-end learning framework was employed by the model, and the Poincaré distance metric in hyperbolic space was used instead of the Euclidean metric as a way to optimize semantically enhanced pattern link graph constructed from the pre-trained language model using probe technology. Secondly, K-head weighted cosine similarity and graph regularization methods were used to learn the similarity metric graph so that the initial pattern link graph was iteratively optimized during training. Finally, the improved Relationship Graph ATtention network (RGAT) graph encoder and multi-head attention mechanism were used to encode the joint semantic pattern link graphs of the two modules, and SQL statement decoding was solved using a grammar-based neural semantic decoder and a predefined structured language. Experimental results on Spider dataset show that when using ELECTRA-large pre-training model, the accuracy of SELSQL model is increased by 2.5 percentage points compared with the best baseline model, which has a great improvement effect on the generation of complex SQL statements, and has better robustness in more general usage scenarios.

Key words: schema linking, graph structure learning, pre-trained language model, Text-to-SQL, heterogeneous graph

中图分类号:

TP183

吴相岚肖洋刘梦莹刘明铭. 基于语义增强模式链接的Text-to-SQL模型[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2023091360.

WU Xianglan, XIAO Yang, LIU Mengying, LIU Mingming. Text-to-SQL model based on semantic enhanced schema linking[J]. Journal of Computer Applications, DOI: 10.11772/j.issn.1001-9081.2023091360.

[1]	王楷天, 叶青, 程春雷. 基于异构图表示的中医电子病历分类方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 411-417.
[2]	马国帅, 钱宇华, 张亚宇, 李俊霞, 刘郭庆. 动态异构信息融合的科研合作潜力预测[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2775-2783.
[3]	黄梦林, 段磊, 张袁昊, 王培妍, 李仁昊. 基于Prompt学习的无监督关系抽取模型[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2010-2016.
[4]	高永兵, 高军甜, 马蓉, 杨立东. 用户粒度级的个性化社交文本生成模型[J]. 《计算机应用》唯一官方网站, 2023, 43(4): 1021-1028.
[5]	许亮, 张春, 张宁, 田雪涛. 融合多Prompt模板的零样本关系抽取模型[J]. 《计算机应用》唯一官方网站, 2023, 43(12): 3668-3675.
[6]	江静, 陈渝, 孙界平, 琚生根. 融合后验概率校准训练的文本分类算法[J]. 《计算机应用》唯一官方网站, 2022, 42(6): 1789-1795.
[7]	张海丰, 曾诚, 潘列, 郝儒松, 温超东, 何鹏. 结合BERT和特征投影网络的新闻主题文本分类方法[J]. 《计算机应用》唯一官方网站, 2022, 42(4): 1116-1124.
[8]	吕剑清, 王先兵, 陈刚, 张华, 王明刚. 面向工业生产的中文Text-to-SQL模型[J]. 《计算机应用》唯一官方网站, 2022, 42(10): 2996-3002.
[9]	王小鹏, 孙媛媛, 林鸿飞. 基于刑事Electra的编-解码关系抽取模型[J]. 《计算机应用》唯一官方网站, 2022, 42(1): 87-93.
[10]	张蓉, 张献国. 基于层次异构图注意力网络的虚假评论检测[J]. 计算机应用, 2021, 41(5): 1275-1281.
[11]	毕蓓, 潘慧瑶, 陈峰, 隋京言, 高扬, 王耀君. 基于异构图注意力网络的微博谣言监测模型[J]. 《计算机应用》唯一官方网站, 2021, 41(12): 3546-3550.
[12]	李志超, 吐尔地·托合提, 艾斯卡尔·艾木都拉. 基于动态注意力和多角度匹配的答案选择模型[J]. 《计算机应用》唯一官方网站, 2021, 41(11): 3156-3163.
[13]	谭金源, 刁宇峰, 祁瑞华, 林鸿飞. 基于BERT-PGN模型的中文新闻文本自动摘要生成[J]. 计算机应用, 2021, 41(1): 127-132.
[14]	陈玉娜, 史晓东. 通过标点恢复提高机器同传效果[J]. 计算机应用, 2020, 40(4): 972-977.
[15]	李扬, 张伟, 彭晨. 目标依赖的作者身份识别方法[J]. 《计算机应用》唯一官方网站, 2020, 40(2): 473-478.

基于语义增强模式链接的Text-to-SQL模型

Text-to-SQL model based on semantic enhanced schema linking

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics