《计算机应用》唯一官方网站 ›› 2026, Vol. 46 ›› Issue (3): 723-731.DOI: 10.11772/j.issn.1001-9081.2025040454
收稿日期:2025-04-25
修回日期:2025-06-11
接受日期:2025-06-12
发布日期:2025-06-23
出版日期:2026-03-10
通讯作者:
崔喆
作者简介:吴定佳(1999—),男,四川巴中人,硕士研究生,主要研究方向:自然语言处理、大语言模型
基金资助:
Dingjia WU1,2, Zhe CUI1(
)
Received:2025-04-25
Revised:2025-06-11
Accepted:2025-06-12
Online:2025-06-23
Published:2026-03-10
Contact:
Zhe CUI
About author:WU Dingjia, born in 1999, M. S. candidate. His research interests include natural language processing, large language models.
Supported by:摘要:
针对大语言模型(LLM)在复杂多表数据库场景下生成结构化查询语言(SQL)的局限性,提出基于多生成器协同的Text-to-SQL框架——MG-SQL(Multi-Generator SQL)。首先,针对无关模式信息导致的噪声干扰,通过生成初始SQL,并结合语义相似度检索,提出增强模式链接优化方法。其次,为提高候选SQL的质量、增强多样性,基于精简模式构建多策略协同生成框架:1)使用经验生成器检索动态示例;2)使用思维链生成器强化逻辑推理;3)使用查询计划生成器模拟数据库的执行流程;4)使用渐进生成器进行迭代优化。再次,使用投票机制对SQL进行择优。最后,进一步提出反思学习机制,通过对比生成结果与参考SQL形成反思样本,动态构建领域经验库以实现持续学习。在BIRD基准测试中的结果表明,采用轻量级GPT-4o-mini模型时,所提框架的模式链接实现了98.89%的严格召回率(SRR),有效筛除了44.91%无关列;所提框架生成的SQL的执行准确率(EX)达69.69%,有效效率分数(VES)达79.59%,超越基于GPT-4o的主流方法,验证了所提框架在复杂场景下的有效性。
中图分类号:
吴定佳, 崔喆. 增强模式链接与多生成器协同的SQL生成框架MG-SQL[J]. 计算机应用, 2026, 46(3): 723-731.
Dingjia WU, Zhe CUI. MG-SQL: SQL generation framework with enhanced schema linking and multi-generator collaboration[J]. Journal of Computer Applications, 2026, 46(3): 723-731.
| 方法 | 模型 | 有效效率得分 | 执行准确率/% | |||
|---|---|---|---|---|---|---|
| 简单 | 中等 | 困难 | 整体 | |||
| XiYan-SQL | 微调模型 | — | — | — | — | 73.34 |
| GPT-4 | GPT-4 | — | — | — | — | 46.35 |
| DIN-SQL | GPT-4 | 58.79 | — | — | — | 50.72 |
| DAIL-SQL | GPT-4 | 56.08 | — | — | — | 54.76 |
| TA-SQL | GPT-4 | — | 63.14 | 48.60 | 36.11 | 56.19 |
| MAC-SQL | GPT-4 | 66.39 | 65.73 | 52.69 | 40.28 | 59.39 |
| ROUTE | Qwen2.5-14B | 65.20 | — | — | — | 60.90 |
| MCS-SQL | GPT-4 | 64.80 | 70.40 | 53.10 | 51.40 | 63.36 |
| CHESS | Proprietary | — | — | — | — | 65.00 |
| E-SQL | GPT-4o | — | — | — | — | 65.58 |
| LPE-SQL | Llama-3.1-70B | — | 72.11 | 59.70 | 51.03 | 66.36 |
| RSL-SQL | GPT-4o | 70.32 | 74.38 | 57.11 | 53.79 | 67.21 |
| Distillery | GPT-4o | — | — | — | — | 67.21 |
| OpenSearch-SQL | GPT-4o | — | — | — | — | 69.30 |
| E-SQL | GPT-4o-mini | — | 68.00 | 53.23 | 47.59 | 61.60 |
| MG-SQL | GPT-4o-mini | 79.59 | 76.00 | 60.78 | 57.93 | 69.69 |
表1 不同方法与模型在BIRD开发集上的执行准确率和有效效率得分对比
Tab. 1 Comparison of execution accuracy and valid efficiency score on BIRD development set of different methods and models
| 方法 | 模型 | 有效效率得分 | 执行准确率/% | |||
|---|---|---|---|---|---|---|
| 简单 | 中等 | 困难 | 整体 | |||
| XiYan-SQL | 微调模型 | — | — | — | — | 73.34 |
| GPT-4 | GPT-4 | — | — | — | — | 46.35 |
| DIN-SQL | GPT-4 | 58.79 | — | — | — | 50.72 |
| DAIL-SQL | GPT-4 | 56.08 | — | — | — | 54.76 |
| TA-SQL | GPT-4 | — | 63.14 | 48.60 | 36.11 | 56.19 |
| MAC-SQL | GPT-4 | 66.39 | 65.73 | 52.69 | 40.28 | 59.39 |
| ROUTE | Qwen2.5-14B | 65.20 | — | — | — | 60.90 |
| MCS-SQL | GPT-4 | 64.80 | 70.40 | 53.10 | 51.40 | 63.36 |
| CHESS | Proprietary | — | — | — | — | 65.00 |
| E-SQL | GPT-4o | — | — | — | — | 65.58 |
| LPE-SQL | Llama-3.1-70B | — | 72.11 | 59.70 | 51.03 | 66.36 |
| RSL-SQL | GPT-4o | 70.32 | 74.38 | 57.11 | 53.79 | 67.21 |
| Distillery | GPT-4o | — | — | — | — | 67.21 |
| OpenSearch-SQL | GPT-4o | — | — | — | — | 69.30 |
| E-SQL | GPT-4o-mini | — | 68.00 | 53.23 | 47.59 | 61.60 |
| MG-SQL | GPT-4o-mini | 79.59 | 76.00 | 60.78 | 57.93 | 69.69 |
| 方法 | 候选生成 | 选择策略 | EX/% |
|---|---|---|---|
| XiYan-SQL | ICL+微调 | 微调模型 | 73.34 |
| XiYan-SQL(w/o 微调生成) | ICL | 微调模型 | 68.67 |
| XiYan-SQL(w/o 选择模型) | ICL+微调 | — | 68.84 |
| CHESS(Proprietary) | ICL | 投票 | 65.00 |
| CHESS(Gemini-1.5-pro) | ICL | 单元测试 | 68.31 |
| OpenSearch-SQL | ICL | 投票 | 69.30 |
| MG-SQL | ICL | 投票 | 69.69 |
表2 采用不同候选生成的方法在执行准确率上的对比
Tab. 2 Comparison of execution accuracy among methods using different candidate generation
| 方法 | 候选生成 | 选择策略 | EX/% |
|---|---|---|---|
| XiYan-SQL | ICL+微调 | 微调模型 | 73.34 |
| XiYan-SQL(w/o 微调生成) | ICL | 微调模型 | 68.67 |
| XiYan-SQL(w/o 选择模型) | ICL+微调 | — | 68.84 |
| CHESS(Proprietary) | ICL | 投票 | 65.00 |
| CHESS(Gemini-1.5-pro) | ICL | 单元测试 | 68.31 |
| OpenSearch-SQL | ICL | 投票 | 69.30 |
| MG-SQL | ICL | 投票 | 69.69 |
| 方法 | NSR/% | SRR/% | 平均表数 | 平均列数 |
|---|---|---|---|---|
| Full Schema | 100.00 | 100.00 | 7.44 | 76.28 |
| Gold-Based | 100.00 | 100.00 | 1.94 | 4.50 |
| MCS-SQL | — | 89.80 | — | — |
| CHESS | 94.00 | 89.70 | 1.92 | 4.47 |
| RSL-SQL | 98.27 | 92.52 | 5.71 | 18.86 |
| MG-SQL | 99.68 | 98.89 | 7.08 | 42.02 |
表3 不同方法在BIRD开发集上的模式链接结果比较
Tab. 3 Comparison of schema linking results of different methods on BIRD development set
| 方法 | NSR/% | SRR/% | 平均表数 | 平均列数 |
|---|---|---|---|---|
| Full Schema | 100.00 | 100.00 | 7.44 | 76.28 |
| Gold-Based | 100.00 | 100.00 | 1.94 | 4.50 |
| MCS-SQL | — | 89.80 | — | — |
| CHESS | 94.00 | 89.70 | 1.92 | 4.47 |
| RSL-SQL | 98.27 | 92.52 | 5.71 | 18.86 |
| MG-SQL | 99.68 | 98.89 | 7.08 | 42.02 |
| 设置 | 执行准确率 | |||
|---|---|---|---|---|
| 简单 | 中等 | 困难 | 整体 | |
| 多生成器 | 76.00 | 60.78 | 57.93 | 69.69 |
| w/o 经验生成器 | 74.59 | 58.41 | 53.10 | 67.67 |
| w/o 思维链生成器 | 74.27 | 60.13 | 54.48 | 68.12 |
| w/o 查询计划生成器 | 74.92 | 61.42 | 54.48 | 68.90 |
| w/o 渐进生成器 | 75.68 | 60.78 | 53.79 | 69.10 |
表4 多生成器在BIRD开发集上的消融实验结果 (%)
Tab. 4 Ablation experimental results of multiple generators on BIRD development set
| 设置 | 执行准确率 | |||
|---|---|---|---|---|
| 简单 | 中等 | 困难 | 整体 | |
| 多生成器 | 76.00 | 60.78 | 57.93 | 69.69 |
| w/o 经验生成器 | 74.59 | 58.41 | 53.10 | 67.67 |
| w/o 思维链生成器 | 74.27 | 60.13 | 54.48 | 68.12 |
| w/o 查询计划生成器 | 74.92 | 61.42 | 54.48 | 68.90 |
| w/o 渐进生成器 | 75.68 | 60.78 | 53.79 | 69.10 |
| 设置 | 执行准确率 | |||
|---|---|---|---|---|
| 简单 | 中等 | 困难 | 整体 | |
| 多生成器 | 76.00 | 60.78 | 57.93 | 69.69 |
| 经验生成器 | 71.24 | 57.11 | 51.72 | 65.12 |
| 思维链生成器 | 69.84 | 54.53 | 46.21 | 62.97 |
| 查询计划生成器 | 72.11 | 56.03 | 51.03 | 65.25 |
| 渐进生成器 | 68.00 | 50.22 | 51.72 | 61.08 |
表5 不同生成器的执行准确率比较 (%)
Tab. 5 Comparison of execution accuracy of different generators unit:%
| 设置 | 执行准确率 | |||
|---|---|---|---|---|
| 简单 | 中等 | 困难 | 整体 | |
| 多生成器 | 76.00 | 60.78 | 57.93 | 69.69 |
| 经验生成器 | 71.24 | 57.11 | 51.72 | 65.12 |
| 思维链生成器 | 69.84 | 54.53 | 46.21 | 62.97 |
| 查询计划生成器 | 72.11 | 56.03 | 51.03 | 65.25 |
| 渐进生成器 | 68.00 | 50.22 | 51.72 | 61.08 |
| 案例 | 难度 | 正确案例 | PREDICTED_SQL | GOLD_SQL |
|---|---|---|---|---|
| 1 | 简单 | Question: What is the ratio of customers who pay in EUR against customers who pay in CZK? | SELECT COUNT(CASE WHEN Currency = 'EUR' THEN 1 END) * 1.0 / COUNT(CASE WHEN Currency = 'CZK' THEN 1 END) AS ratio FROM customers | SELECT CAST(SUM(IIF(Currency = 'EUR', 1, 0)) AS FLOAT) / SUM (IIF(Currency = 'CZK', 1, 0)) AS ratio FROM customers |
Evidence: ratio of customers who pay in EUR against customers who pay in CZK = count(Currency = 'EUR') / count (Currency = 'CZK'). | ||||
| 2 | 中等 | Question: For the year that concluded on December 31, 1998, how many male patients on average were tested in the lab each month? | SELECT CAST(COUNT(Laboratory.ID) AS REAL) / 12 FROM Laboratory INNER JOIN Patient ON Laboratory.ID = Patient.ID WHERE Laboratory.Date BETWEEN '1998-01-01' AND '1998-12-31' AND Patient.SEX = 'M' | SELECT CAST(COUNT(T1.ID) AS REAL) / 12 FROM Patient AS T1 INNER JOIN Laboratory AS T2 ON T1.ID = T2.ID WHERE STRFTIME('%Y', T2.Date) = '1998' AND T1.SEX = 'M' |
Evidence: the year that concluded on December 31, 1998 refers to Date BETWEEN '1998-01-01' AND '1998-12-31'; male refers to SEX = 'M'; calculation = DIVIDE(COUNT(ID), 12) | ||||
| 3 | 困难 | Question: Please list the patient's ID if he or she has an abnormal level of red blood cell and is followed at the outpatient clinic. | SELECT P.ID FROM Patient P JOIN Laboratory L ON P.ID = L.ID WHERE (L.RBC <= 3.5 OR L.RBC >= 6.0) AND P.Admission = '-' | SELECT DISTINCT T1.ID FROM Patient AS T1 INNER JOIN Laboratory AS T2 ON T1.ID = T2.ID WHERE (T2.RBC <= 3.5 OR T2.RBC >= 6) AND T1.Admission = '-' |
Evidence: RBC < = 3.5 or RBC > = 6.0 means the patient has an abnormal level of red blood cell; 3.5 < RBC < 6.0 means the patient has a normal level of red blood cell; followed at the outpatient clinic refers to Admission = '-' |
表6 在BIRD开发集上生成的SQL对比
Tab. 6 Comparison of generated SQLs on BIRD development set
| 案例 | 难度 | 正确案例 | PREDICTED_SQL | GOLD_SQL |
|---|---|---|---|---|
| 1 | 简单 | Question: What is the ratio of customers who pay in EUR against customers who pay in CZK? | SELECT COUNT(CASE WHEN Currency = 'EUR' THEN 1 END) * 1.0 / COUNT(CASE WHEN Currency = 'CZK' THEN 1 END) AS ratio FROM customers | SELECT CAST(SUM(IIF(Currency = 'EUR', 1, 0)) AS FLOAT) / SUM (IIF(Currency = 'CZK', 1, 0)) AS ratio FROM customers |
Evidence: ratio of customers who pay in EUR against customers who pay in CZK = count(Currency = 'EUR') / count (Currency = 'CZK'). | ||||
| 2 | 中等 | Question: For the year that concluded on December 31, 1998, how many male patients on average were tested in the lab each month? | SELECT CAST(COUNT(Laboratory.ID) AS REAL) / 12 FROM Laboratory INNER JOIN Patient ON Laboratory.ID = Patient.ID WHERE Laboratory.Date BETWEEN '1998-01-01' AND '1998-12-31' AND Patient.SEX = 'M' | SELECT CAST(COUNT(T1.ID) AS REAL) / 12 FROM Patient AS T1 INNER JOIN Laboratory AS T2 ON T1.ID = T2.ID WHERE STRFTIME('%Y', T2.Date) = '1998' AND T1.SEX = 'M' |
Evidence: the year that concluded on December 31, 1998 refers to Date BETWEEN '1998-01-01' AND '1998-12-31'; male refers to SEX = 'M'; calculation = DIVIDE(COUNT(ID), 12) | ||||
| 3 | 困难 | Question: Please list the patient's ID if he or she has an abnormal level of red blood cell and is followed at the outpatient clinic. | SELECT P.ID FROM Patient P JOIN Laboratory L ON P.ID = L.ID WHERE (L.RBC <= 3.5 OR L.RBC >= 6.0) AND P.Admission = '-' | SELECT DISTINCT T1.ID FROM Patient AS T1 INNER JOIN Laboratory AS T2 ON T1.ID = T2.ID WHERE (T2.RBC <= 3.5 OR T2.RBC >= 6) AND T1.Admission = '-' |
Evidence: RBC < = 3.5 or RBC > = 6.0 means the patient has an abnormal level of red blood cell; 3.5 < RBC < 6.0 means the patient has a normal level of red blood cell; followed at the outpatient clinic refers to Admission = '-' |
| 问题ID | 问题简述 | 错误类型 | 错误SQL(核心片段) | 正确SQL(核心片段) | 错误分析 |
|---|---|---|---|---|---|
| 2 | 查询Fresno教育局下所有特许学校的邮编 | C1 | SELECT DISTINCT schools.MailZip … | SELECT T2.Zip … (T2为schools别名) | 选择了MailZip而非Zip列 |
| 16 | 查询Alameda合并学校中测试人数少于100的学校数量 | C3 | 未包含StatusType = 'Merged'条件 | … WHERE T1.StatusType = 'Merged' … | 关键过滤条件缺失 |
| 30 | 查询K-12入学人数最少的5个城市 | C4 | … ORDER BY f.'Enrollment (K-12)' ASC | … GROUP BY T2.City ORDER BY SUM(T1.'Enrollment (K-12)') ASC | 缺少按城市分组及对入学人数求和 |
| 24 | 查询符合条件的学校名称,条件含餐食比例计算 | C5 | (frpm.'Free Meal Count (K-12)' / frpm.'Enrollment (K-12)') >= 0.1 | CAST(T2.'Free Meal Count (K-12)' AS REAL) / T2.'Enrollment (K-12)' > 0.1 | 缺少类型转换导致整数除法,比较符错误 |
| 193 | 查询氯元素参与的化学键类型 | C2 | … atom AS T1 INNER JOIN bond AS T2 ON T1.molecule_id = T2.molecule_id … | … bond AS T1 INNER JOIN connected AS T2 ON T1.bond_id = T2.bond_id INNER JOIN atom AS T3 ON T2.atom_id = T3.atom_id … | 缺少了连接原子和化学键的关键表connected |
| 349 | 查询在第一轮资格赛中被淘汰的5名车手 | C6 | … ORDER BY Q.q1 LIMIT 5 (最快5名) | … ORDER BY T1.q1 DESC LIMIT 5 (最慢5名) | 错误理解“淘汰”的含义(应为时间最慢) |
| 28 | 查询本地资助学校K-12与5~17岁入学人数平均差额 | C7 | 子查询:… FROM frpm WHERE frpm.'Charter Funding Type' = 'Locally funded' | 子查询:… FROM frpm AS T3 INNER JOIN schools AS T4 ON T3.CDSCode = T4.CDSCode WHERE T4.FundingType = 'Locally funded' | 子查询未正确连接schools表来按FundingType过滤,错误使用了frpm表的字段 |
表7 在BIRD开发集上的错误生成SQL分析
Tab. 7 Analysis of incorrectly generated SQLs on BIRD development set
| 问题ID | 问题简述 | 错误类型 | 错误SQL(核心片段) | 正确SQL(核心片段) | 错误分析 |
|---|---|---|---|---|---|
| 2 | 查询Fresno教育局下所有特许学校的邮编 | C1 | SELECT DISTINCT schools.MailZip … | SELECT T2.Zip … (T2为schools别名) | 选择了MailZip而非Zip列 |
| 16 | 查询Alameda合并学校中测试人数少于100的学校数量 | C3 | 未包含StatusType = 'Merged'条件 | … WHERE T1.StatusType = 'Merged' … | 关键过滤条件缺失 |
| 30 | 查询K-12入学人数最少的5个城市 | C4 | … ORDER BY f.'Enrollment (K-12)' ASC | … GROUP BY T2.City ORDER BY SUM(T1.'Enrollment (K-12)') ASC | 缺少按城市分组及对入学人数求和 |
| 24 | 查询符合条件的学校名称,条件含餐食比例计算 | C5 | (frpm.'Free Meal Count (K-12)' / frpm.'Enrollment (K-12)') >= 0.1 | CAST(T2.'Free Meal Count (K-12)' AS REAL) / T2.'Enrollment (K-12)' > 0.1 | 缺少类型转换导致整数除法,比较符错误 |
| 193 | 查询氯元素参与的化学键类型 | C2 | … atom AS T1 INNER JOIN bond AS T2 ON T1.molecule_id = T2.molecule_id … | … bond AS T1 INNER JOIN connected AS T2 ON T1.bond_id = T2.bond_id INNER JOIN atom AS T3 ON T2.atom_id = T3.atom_id … | 缺少了连接原子和化学键的关键表connected |
| 349 | 查询在第一轮资格赛中被淘汰的5名车手 | C6 | … ORDER BY Q.q1 LIMIT 5 (最快5名) | … ORDER BY T1.q1 DESC LIMIT 5 (最慢5名) | 错误理解“淘汰”的含义(应为时间最慢) |
| 28 | 查询本地资助学校K-12与5~17岁入学人数平均差额 | C7 | 子查询:… FROM frpm WHERE frpm.'Charter Funding Type' = 'Locally funded' | 子查询:… FROM frpm AS T3 INNER JOIN schools AS T4 ON T3.CDSCode = T4.CDSCode WHERE T4.FundingType = 'Locally funded' | 子查询未正确连接schools表来按FundingType过滤,错误使用了frpm表的字段 |
| [1] | CHEN W. Large language models are few(1)-shot table reasoners[C]// Findings of the Association for Computational Linguistics: EACL 2023. Stroudsburg: ACL, 2023: 1120-1130. |
| [2] | MOHAMMADJAFARI A, MAIDA A S, GOTTUMUKKALA R. From natural language to SQL: review of LLM-based text-to-SQL systems[EB/OL]. [2024-12-10].. |
| [3] | POURREZA M, RAFIEI D. DIN-SQL: decomposed in-context learning of text-to-SQL with self-correction [C]// Proceedings of the 37th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2023: 36339-36348. |
| [4] | GAO D, WANG H, LI Y, et al. Text-to-SQL empowered by large language models: a benchmark evaluation [J]. Proceedings of the VLDB Endowment, 2024, 17(5): 1132-1145. |
| [5] | WANG B, REN C, YANG J, et al. MAC-SQL: a multi-agent collaborative framework for Text-to-SQL [C]// Proceedings of the 31st International Conference on Computational Linguistics. Stroudsburg: ACL, 2025: 540-557. |
| [6] | WEI J, WANG X, SCHUURMANS D, et al. Chain-of-thought prompting elicits reasoning in large language models [C]// Proceedings of the 36th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2022: 24824-24837. |
| [7] | LI Z, WANG X, ZHAO J, et al. PET-SQL: a prompt-enhanced two-round refinement of Text-to-SQL with cross-consistency [EB/OL]. [2024-12-19].. |
| [8] | CHU Z, WANG Z, QIN Q. Leveraging prior experience: an expandable auxiliary knowledge base for text-to-SQL [EB/OL]. [2024-12-23].. |
| [9] | QU G, LI J, LI B, et al. Before generation, align it! a novel and effective strategy for mitigating hallucinations in text-to-SQL generation [C]// Findings of the Association for Computational Linguistics: ACL 2024. Stroudsburg: ACL, 2024: 5456-5471. |
| [10] | TALAEI S, POURREZA M, CHANG Y C, et al. CHESS: contextual harnessing for efficient SQL synthesis [EB/OL]. [2025-01-06].. |
| [11] | CAO Z, ZHENG Y, FAN Z, et al. RSL-SQL: robust schema linking in Text-to-SQL generation [EB/OL]. [2024-12-27].. |
| [12] | POURREZA M, LI H, SUN R, et al. CHASE-SQL: multi-path reasoning and preference optimized candidate selection in Text-to-SQL [EB/OL]. [2025-01-03].. |
| [13] | ZHANG H, CAO R, CHEN L, et al. ACT-SQL: in-context learning for text-to-SQL with automatically-generated chain-of-thought [C]// Findings of the Association for Computational Linguistics: EMNLP 2023. Stroudsburg: ACL, 2023: 3501-3532. |
| [14] | WANG L, QIN B, HUI B, et al. Proton: probing schema linking information from pre-trained language models for text-to-SQL parsing [C]// Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. New York: ACM, 2022: 1889-1898. |
| [15] | LEE D, PARK C, KIM J, et al. MCS-SQL: leveraging multiple prompts and multiple-choice selection for text-to-SQL generation[C]// Proceedings of the 31st International Conference on Computational Linguistics. Stroudsburg: ACL, 2025: 337-353. |
| [16] | GAO Y, LIU Y, LI X, et al. A preview of XiYan-SQL: a multi-generator ensemble framework for text-to-SQL [EB/OL]. [2024-12-20].. |
| [17] | XIE X, XU G, ZHAO L, et al. OpenSearch-SQL: enhancing Text-to-SQL with dynamic few-shot and consistency alignment [EB/OL]. [2025-02-25].. |
| [18] | GAN Y, CHEN X, PURVER M. Re-appraising the schema linking for text-to-SQL [C]// Findings of the Association for Computational Linguistics: ACL 2023. Stroudsburg: ACL, 2023: 835-852. |
| [19] | LIU A, HU X, WEN L, et al. A comprehensive evaluation of ChatGPT’s zero-shot Text-to-SQL capability [EB/OL]. [2025-02-09].. |
| [20] | JI Z, LEE N, FRIESKE R, et al. Survey of hallucination in natural language generation [J]. ACM Computing Surveys, 2023, 55(12): No.248. |
| [21] | LI J, HUI B, QU G, et al. Can LLM already serve as a database interface? a big bench for large-scale database grounded text-to-SQLs [C]// Proceedings of the 37th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2023: 42330-42357. |
| [22] | CAFEROĞLU H A, ULUSOY Ö. E-SQL: direct schema linking via question enrichment in Text-to-SQL [EB/OL]. [2025-01-07].. |
| [23] | QIN Y, CHEN C, FU Z, et al. ROUTE: robust multitask tuning and collaboration for Text-to-SQL [EB/OL]. [2024-12-17].. |
| [24] | MAAMARI K, ABUBAKER F, JAROSLAWICZ D, et al. The death of schema linking? Text-to-SQL in the age of well-reasoned language models [EB/OL]. [2025-01-06].. |
| [1] | 沈斌, 陈晓宁, 程华, 房一泉, 王慧锋. 基于大语言模型的本科教学评估智能系统[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 993-1003. |
| [2] | 郗恩康, 范菁, 金亚东, 董华, 俞浩, 孙伊航. 联邦学习在隐私安全领域面临的威胁综述[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 798-808. |
| [3] | 黄奕明, 邹喜华, 邓果, 郑狄. 预回答与召回过滤:双阶段RAG问答系统优化方法[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 696-707. |
| [4] | 王日龙, 李振平, 李晓松, 高强, 何亚, 钟勇, 赵英潇. 多Agent协作的知识推理框架[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 708-714. |
| [5] | 张昊洋, 张丽萍, 闫盛, 李娜, 张学飞. 面向知识图谱补全的大模型方法综述[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 683-695. |
| [6] | 刘宜欣, 刘祥根, 刘文, 邓洪波, 张子野, 穆骅. 面向长文档检索增强生成的基准数据集[J]. 《计算机应用》唯一官方网站, 2026, 46(2): 386-394. |
| [7] | 高飞, 陈董, 边帝行, 范文强, 刘起东, 吕培, 张朝阳, 徐明亮. 面向学科撤销后科研人员重分配的多阶段耦合决策框架[J]. 《计算机应用》唯一官方网站, 2026, 46(2): 416-426. |
| [8] | 谢欣冉, 崔喆, 陈睿, 彭泰来, 林德坤. 基于层次过滤与标签语义扩展的大模型零样本重排序方法[J]. 《计算机应用》唯一官方网站, 2026, 46(1): 60-68. |
| [9] | 林怡, 夏冰, 王永, 孟顺达, 刘居宠, 张书钦. 基于AI智能体的隐藏RESTful API识别与漏洞检测方法[J]. 《计算机应用》唯一官方网站, 2026, 46(1): 135-143. |
| [10] | 张滨滨, 秦永彬, 黄瑞章, 陈艳平. 结合大语言模型与动态提示的裁判文书摘要方法[J]. 《计算机应用》唯一官方网站, 2025, 45(9): 2783-2789. |
| [11] | 冯涛, 刘晨. 自动化偏好对齐的双阶段提示调优方法[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2442-2447. |
| [12] | 孙熠衡, 刘茂福. 基于知识提示微调的标书信息抽取方法[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1169-1176. |
| [13] | 曹鹏, 温广琪, 杨金柱, 陈刚, 刘歆一, 季学纯. 面向测试用例生成的大模型高效微调方法[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 725-731. |
| [14] | 张艳萍, 陈梅芳, 田昌海, 易子博, 胡文鹏, 罗威, 罗准辰. 面向军事领域知识问答系统的多策略检索增强生成方法[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 746-754. |
| [15] | 鲁超峰, 陶冶, 文连庆, 孟菲, 秦修功, 杜永杰, 田云龙. 融合大语言模型和预训练模型的少量语料说话人-情感语音转换方法[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 815-822. |
| 阅读次数 | ||||||
|
全文 |
|
|||||
|
摘要 |
|
|||||