增强模式链接与多生成器协同的SQL生成框架MG-SQL

doi:10.11772/j.issn.1001-9081.2025040454

《计算机应用》唯一官方网站 ›› 2026, Vol. 46 ›› Issue (3): 723-731.DOI: 10.11772/j.issn.1001-9081.2025040454

增强模式链接与多生成器协同的SQL生成框架MG-SQL

吴定佳¹^,², 崔喆¹()

^1.中国科学院成都计算机应用研究所，成都 610213
^2.中国科学院大学计算机科学与技术学院，北京 100049

收稿日期:2025-04-25 修回日期:2025-06-11 接受日期:2025-06-12 发布日期:2025-06-23 出版日期:2026-03-10
通讯作者: 崔喆
作者简介:吴定佳（1999—），男，四川巴中人，硕士研究生，主要研究方向：自然语言处理、大语言模型
基金资助:
四川省自然科学基金资助项目(2024NSFSC0004)

MG-SQL： SQL generation framework with enhanced schema linking and multi-generator collaboration

Dingjia WU¹^,², Zhe CUI¹()

^1.Chengdu Institute of Computer Application，Chinese Academy of Sciences，Chengdu Sichuan 610213，China
^2.School of Computer Science and Technology，University of Chinese Academy of Sciences，Beijing 100049，China

Received:2025-04-25 Revised:2025-06-11 Accepted:2025-06-12 Online:2025-06-23 Published:2026-03-10
Contact: Zhe CUI
About author:WU Dingjia， born in 1999， M. S. candidate. His research interests include natural language processing， large language models.
Supported by:
Natural Science and Technology Foundation of Sichuan Province(2024NSFSC0004)

摘要/Abstract

摘要：

针对大语言模型（LLM）在复杂多表数据库场景下生成结构化查询语言（SQL）的局限性，提出基于多生成器协同的Text-to-SQL框架——MG-SQL（Multi-Generator SQL）。首先，针对无关模式信息导致的噪声干扰，通过生成初始SQL，并结合语义相似度检索，提出增强模式链接优化方法。其次，为提高候选SQL的质量、增强多样性，基于精简模式构建多策略协同生成框架：1）使用经验生成器检索动态示例；2）使用思维链生成器强化逻辑推理；3）使用查询计划生成器模拟数据库的执行流程；4）使用渐进生成器进行迭代优化。再次，使用投票机制对SQL进行择优。最后，进一步提出反思学习机制，通过对比生成结果与参考SQL形成反思样本，动态构建领域经验库以实现持续学习。在BIRD基准测试中的结果表明，采用轻量级GPT-4o-mini模型时，所提框架的模式链接实现了98.89%的严格召回率（SRR），有效筛除了44.91%无关列；所提框架生成的SQL的执行准确率（EX）达69.69%，有效效率分数（VES）达79.59%，超越基于GPT-4o的主流方法，验证了所提框架在复杂场景下的有效性。

关键词: 模式链接, 大语言模型, Text-to-SQL, 检索增强, 上下文学习

Abstract:

To address the limitations of Large Language Models （LLMs） in generating Structured Query Language （SQL） in complex multi-table database scenarios， a multi-generator collaboration-based Text-SQL framework MG-SQL （Multi-Generator SQL） based on collaborative generators was proposed. Firstly， to mitigate noise interference caused by irrelevant schema information， the optimization method for enhancing schema linking process was proposed by generating initial SQLs and combining semantic similarity-based retrieval. Secondly， to improve the quality and diversity of candidate SQLs， a multi-strategy collaborative generation framework was developed on the basis of refined schema： 1） the experience generator was used to retrieve dynamic examples； 2） the chain-of-thought generator was used to strengthen logical reasoning； 3） the query plan generator was used to simulate database execution flows； and 4） the progressive generator was used to perform iterative optimization. Thirdly， the optimal SQL was selected through voting mechanism. Finally， a reflective learning mechanism was further proposed， where the generated results and reference SQL were compared to form reflective samples， so as to construct domain-specific knowledge base dynamically for continuous learning. The BIRD benchmark results demonstrate that， when employing the lightweight GPT-4o-mini model， the proposed framework’s schema linking achieves a 98.89% Strict Recall Rate （SRR） while effectively filtering out 44.91% of irrelevant columns； the SQL generated by the proposed framework achieves a 69.69% EXecution accuracy （EX） and a 79.59% Valid Efficiency Score （VES）， outperforming mainstream GPT-4o-based approaches， which validates the effectiveness of the proposed framework in complex scenarios.

Key words: schema linking, large Language Model (LLM), Text-to-Structured Query Language (SQL), retrieval-augmented, In-Context Learning (ICL)

中图分类号:

TP311.13

吴定佳, 崔喆. 增强模式链接与多生成器协同的SQL生成框架MG-SQL[J]. 计算机应用, 2026, 46(3): 723-731.

Dingjia WU, Zhe CUI. MG-SQL： SQL generation framework with enhanced schema linking and multi-generator collaboration[J]. Journal of Computer Applications, 2026, 46(3): 723-731.

图/表 8

参考文献 24

[1]	CHEN W. Large language models are few（1）-shot table reasoners［C］// Findings of the Association for Computational Linguistics： EACL 2023. Stroudsburg： ACL， 2023： 1120-1130.
[2]	MOHAMMADJAFARI A， MAIDA A S， GOTTUMUKKALA R. From natural language to SQL： review of LLM-based text-to-SQL systems［EB/OL］. ［2024-12-10］..
[3]	POURREZA M， RAFIEI D. DIN-SQL： decomposed in-context learning of text-to-SQL with self-correction ［C］// Proceedings of the 37th International Conference on Neural Information Processing Systems. Red Hook： Curran Associates Inc.， 2023： 36339-36348.
[4]	GAO D， WANG H， LI Y， et al. Text-to-SQL empowered by large language models： a benchmark evaluation ［J］. Proceedings of the VLDB Endowment， 2024， 17（5）： 1132-1145.
[5]	WANG B， REN C， YANG J， et al. MAC-SQL： a multi-agent collaborative framework for Text-to-SQL ［C］// Proceedings of the 31st International Conference on Computational Linguistics. Stroudsburg： ACL， 2025： 540-557.
[6]	WEI J， WANG X， SCHUURMANS D， et al. Chain-of-thought prompting elicits reasoning in large language models ［C］// Proceedings of the 36th International Conference on Neural Information Processing Systems. Red Hook： Curran Associates Inc.， 2022： 24824-24837.
[7]	LI Z， WANG X， ZHAO J， et al. PET-SQL： a prompt-enhanced two-round refinement of Text-to-SQL with cross-consistency ［EB/OL］. ［2024-12-19］..
[8]	CHU Z， WANG Z， QIN Q. Leveraging prior experience： an expandable auxiliary knowledge base for text-to-SQL ［EB/OL］. ［2024-12-23］..
[9]	QU G， LI J， LI B， et al. Before generation， align it！ a novel and effective strategy for mitigating hallucinations in text-to-SQL generation ［C］// Findings of the Association for Computational Linguistics： ACL 2024. Stroudsburg： ACL， 2024： 5456-5471.
[10]	TALAEI S， POURREZA M， CHANG Y C， et al. CHESS： contextual harnessing for efficient SQL synthesis ［EB/OL］. ［2025-01-06］..
[11]	CAO Z， ZHENG Y， FAN Z， et al. RSL-SQL： robust schema linking in Text-to-SQL generation ［EB/OL］. ［2024-12-27］..
[12]	POURREZA M， LI H， SUN R， et al. CHASE-SQL： multi-path reasoning and preference optimized candidate selection in Text-to-SQL ［EB/OL］. ［2025-01-03］..
[13]	ZHANG H， CAO R， CHEN L， et al. ACT-SQL： in-context learning for text-to-SQL with automatically-generated chain-of-thought ［C］// Findings of the Association for Computational Linguistics： EMNLP 2023. Stroudsburg： ACL， 2023： 3501-3532.
[14]	WANG L， QIN B， HUI B， et al. Proton： probing schema linking information from pre-trained language models for text-to-SQL parsing ［C］// Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. New York： ACM， 2022： 1889-1898.
[15]	LEE D， PARK C， KIM J， et al. MCS-SQL： leveraging multiple prompts and multiple-choice selection for text-to-SQL generation［C］// Proceedings of the 31st International Conference on Computational Linguistics. Stroudsburg： ACL， 2025： 337-353.
[16]	GAO Y， LIU Y， LI X， et al. A preview of XiYan-SQL： a multi-generator ensemble framework for text-to-SQL ［EB/OL］. ［2024-12-20］..
[17]	XIE X， XU G， ZHAO L， et al. OpenSearch-SQL： enhancing Text-to-SQL with dynamic few-shot and consistency alignment ［EB/OL］. ［2025-02-25］..
[18]	GAN Y， CHEN X， PURVER M. Re-appraising the schema linking for text-to-SQL ［C］// Findings of the Association for Computational Linguistics： ACL 2023. Stroudsburg： ACL， 2023： 835-852.
[19]	LIU A， HU X， WEN L， et al. A comprehensive evaluation of ChatGPT’s zero-shot Text-to-SQL capability ［EB/OL］. ［2025-02-09］..
[20]	JI Z， LEE N， FRIESKE R， et al. Survey of hallucination in natural language generation ［J］. ACM Computing Surveys， 2023， 55（12）： No.248.
[21]	LI J， HUI B， QU G， et al. Can LLM already serve as a database interface？ a big bench for large-scale database grounded text-to-SQLs ［C］// Proceedings of the 37th International Conference on Neural Information Processing Systems. Red Hook： Curran Associates Inc.， 2023： 42330-42357.
[22]	CAFEROĞLU H A， ULUSOY Ö. E-SQL： direct schema linking via question enrichment in Text-to-SQL ［EB/OL］. ［2025-01-07］..
[23]	QIN Y， CHEN C， FU Z， et al. ROUTE： robust multitask tuning and collaboration for Text-to-SQL ［EB/OL］. ［2024-12-17］..
[24]	MAAMARI K， ABUBAKER F， JAROSLAWICZ D， et al. The death of schema linking？ Text-to-SQL in the age of well-reasoned language models ［EB/OL］. ［2025-01-06］..

方法	候选生成	选择策略	EX/%
XiYan-SQL	ICL+微调	微调模型	73.34
XiYan-SQL（w/o 微调生成）	ICL	微调模型	68.67
XiYan-SQL（w/o 选择模型）	ICL+微调	—	68.84
CHESS（Proprietary）	ICL	投票	65.00
CHESS（Gemini-1.5-pro）	ICL	单元测试	68.31
OpenSearch-SQL	ICL	投票	69.30
MG-SQL	ICL	投票	69.69

方法	候选生成	选择策略	EX/%
XiYan-SQL	ICL+微调	微调模型	73.34
XiYan-SQL（w/o 微调生成）	ICL	微调模型	68.67
XiYan-SQL（w/o 选择模型）	ICL+微调	—	68.84
CHESS（Proprietary）	ICL	投票	65.00
CHESS（Gemini-1.5-pro）	ICL	单元测试	68.31
OpenSearch-SQL	ICL	投票	69.30
MG-SQL	ICL	投票	69.69

方法	NSR/%	SRR/%	平均表数	平均列数
Full Schema	100.00	100.00	7.44	76.28
Gold-Based	100.00	100.00	1.94	4.50
MCS-SQL	—	89.80	—	—
CHESS	94.00	89.70	1.92	4.47
RSL-SQL	98.27	92.52	5.71	18.86
MG-SQL	99.68	98.89	7.08	42.02

方法	NSR/%	SRR/%	平均表数	平均列数
Full Schema	100.00	100.00	7.44	76.28
Gold-Based	100.00	100.00	1.94	4.50
MCS-SQL	—	89.80	—	—
CHESS	94.00	89.70	1.92	4.47
RSL-SQL	98.27	92.52	5.71	18.86
MG-SQL	99.68	98.89	7.08	42.02

设置	执行准确率
设置	简单	中等	困难	整体
多生成器	76.00	60.78	57.93	69.69
w/o 经验生成器	74.59	58.41	53.10	67.67
w/o 思维链生成器	74.27	60.13	54.48	68.12
w/o 查询计划生成器	74.92	61.42	54.48	68.90
w/o 渐进生成器	75.68	60.78	53.79	69.10

增强模式链接与多生成器协同的SQL生成框架MG-SQL

MG-SQL： SQL generation framework with enhanced schema linking and multi-generator collaboration

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 8

参考文献 24

相关文章 15

编辑推荐

Metrics

方法	模型	有效效率得分	执行准确率/%
方法	模型	有效效率得分	简单	中等	困难	整体
XiYan-SQL	微调模型	—	—	—	—	73.34
GPT-4	GPT-4	—	—	—	—	46.35
DIN-SQL	GPT-4	58.79	—	—	—	50.72
DAIL-SQL	GPT-4	56.08	—	—	—	54.76
TA-SQL	GPT-4	—	63.14	48.60	36.11	56.19
MAC-SQL	GPT-4	66.39	65.73	52.69	40.28	59.39
ROUTE	Qwen2.5-14B	65.20	—	—	—	60.90
MCS-SQL	GPT-4	64.80	70.40	53.10	51.40	63.36
CHESS	Proprietary	—	—	—	—	65.00
E-SQL	GPT-4o	—	—	—	—	65.58
LPE-SQL	Llama-3.1-70B	—	72.11	59.70	51.03	66.36
RSL-SQL	GPT-4o	70.32	74.38	57.11	53.79	67.21
Distillery	GPT-4o	—	—	—	—	67.21
OpenSearch-SQL	GPT-4o	—	—	—	—	69.30
E-SQL	GPT-4o-mini	—	68.00	53.23	47.59	61.60
MG-SQL	GPT-4o-mini	79.59	76.00	60.78	57.93	69.69

问题ID	问题简述	错误类型	错误SQL（核心片段）	正确SQL（核心片段）	错误分析
2	查询Fresno教育局下所有特许学校的邮编	C1	SELECT DISTINCT schools.MailZip …	SELECT T2.Zip … （T2为schools别名）	选择了MailZip而非Zip列
16	查询Alameda合并学校中测试人数少于100的学校数量	C3	未包含StatusType = 'Merged'条件	… WHERE T1.StatusType = 'Merged' …	关键过滤条件缺失
30	查询K-12入学人数最少的5个城市	C4	… ORDER BY f.'Enrollment （K-12）' ASC	… GROUP BY T2.City ORDER BY SUM（T1.'Enrollment （K-12）'） ASC	缺少按城市分组及对入学人数求和
24	查询符合条件的学校名称，条件含餐食比例计算	C5	（frpm.'Free Meal Count （K-12）' / frpm.'Enrollment （K-12）'） >= 0.1	CAST（T2.'Free Meal Count （K-12）' AS REAL） / T2.'Enrollment （K-12）' > 0.1	缺少类型转换导致整数除法，比较符错误
193	查询氯元素参与的化学键类型	C2	… atom AS T1 INNER JOIN bond AS T2 ON T1.molecule_id = T2.molecule_id …	… bond AS T1 INNER JOIN connected AS T2 ON T1.bond_id = T2.bond_id INNER JOIN atom AS T3 ON T2.atom_id = T3.atom_id …	缺少了连接原子和化学键的关键表connected
349	查询在第一轮资格赛中被淘汰的5名车手	C6	… ORDER BY Q.q1 LIMIT 5 （最快5名）	… ORDER BY T1.q1 DESC LIMIT 5 （最慢5名）	错误理解“淘汰”的含义（应为时间最慢）
28	查询本地资助学校K-12与5~17岁入学人数平均差额	C7	子查询：… FROM frpm WHERE frpm.'Charter Funding Type' = 'Locally funded'	子查询：… FROM frpm AS T3 INNER JOIN schools AS T4 ON T3.CDSCode = T4.CDSCode WHERE T4.FundingType = 'Locally funded'	子查询未正确连接schools表来按FundingType过滤，错误使用了frpm表的字段

[1]	沈斌, 陈晓宁, 程华, 房一泉, 王慧锋. 基于大语言模型的本科教学评估智能系统[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 993-1003.
[2]	郗恩康, 范菁, 金亚东, 董华, 俞浩, 孙伊航. 联邦学习在隐私安全领域面临的威胁综述[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 798-808.
[3]	黄奕明, 邹喜华, 邓果, 郑狄. 预回答与召回过滤：双阶段RAG问答系统优化方法[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 696-707.
[4]	王日龙, 李振平, 李晓松, 高强, 何亚, 钟勇, 赵英潇. 多Agent协作的知识推理框架[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 708-714.
[5]	张昊洋, 张丽萍, 闫盛, 李娜, 张学飞. 面向知识图谱补全的大模型方法综述[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 683-695.
[6]	刘宜欣, 刘祥根, 刘文, 邓洪波, 张子野, 穆骅. 面向长文档检索增强生成的基准数据集[J]. 《计算机应用》唯一官方网站, 2026, 46(2): 386-394.
[7]	高飞, 陈董, 边帝行, 范文强, 刘起东, 吕培, 张朝阳, 徐明亮. 面向学科撤销后科研人员重分配的多阶段耦合决策框架[J]. 《计算机应用》唯一官方网站, 2026, 46(2): 416-426.
[8]	谢欣冉, 崔喆, 陈睿, 彭泰来, 林德坤. 基于层次过滤与标签语义扩展的大模型零样本重排序方法[J]. 《计算机应用》唯一官方网站, 2026, 46(1): 60-68.
[9]	林怡, 夏冰, 王永, 孟顺达, 刘居宠, 张书钦. 基于AI智能体的隐藏RESTful API识别与漏洞检测方法[J]. 《计算机应用》唯一官方网站, 2026, 46(1): 135-143.
[10]	张滨滨, 秦永彬, 黄瑞章, 陈艳平. 结合大语言模型与动态提示的裁判文书摘要方法[J]. 《计算机应用》唯一官方网站, 2025, 45(9): 2783-2789.
[11]	冯涛, 刘晨. 自动化偏好对齐的双阶段提示调优方法[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2442-2447.
[12]	孙熠衡, 刘茂福. 基于知识提示微调的标书信息抽取方法[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1169-1176.
[13]	曹鹏, 温广琪, 杨金柱, 陈刚, 刘歆一, 季学纯. 面向测试用例生成的大模型高效微调方法[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 725-731.
[14]	张艳萍, 陈梅芳, 田昌海, 易子博, 胡文鹏, 罗威, 罗准辰. 面向军事领域知识问答系统的多策略检索增强生成方法[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 746-754.
[15]	鲁超峰, 陶冶, 文连庆, 孟菲, 秦修功, 杜永杰, 田云龙. 融合大语言模型和预训练模型的少量语料说话人-情感语音转换方法[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 815-822.