基于文档压缩与实体提示的长文档摘要可行性

doi:10.11772/j.issn.1001-9081.2025101263

《计算机应用》唯一官方网站

• • 下一篇

基于文档压缩与实体提示的长文档摘要可行性

李咨谷¹，陈景强^1,2*

1. 南京邮电大学计算机学院, 南京 210023;
2. 江苏省大数据安全与智能处理重点实验室(南京邮电大学)，南京 210023

收稿日期:2025-10-28 修回日期:2026-01-26 接受日期:2026-01-28 发布日期:2026-02-06 出版日期:2026-02-06
通讯作者: 陈景强
基金资助:
国家自然科学基金

Long document summarization feasibility with document compression and entity prompt mechanism

Received:2025-10-28 Revised:2026-01-26 Accepted:2026-01-28 Online:2026-02-06 Published:2026-02-06
Contact: Jing qiangCHEN

摘要/Abstract

摘要： 长文档摘要任务由于文档结构复杂、篇幅较长而面临显著挑战。现有的大多数生成式框架在训练过程中易受噪声干扰，导致无关语义信息影响有效知识的获取。为缓解这一问题，提出一种融合文档压缩与实体提示的方法，用以过滤无关内容并强化对关键信息的学习。该方法提出基于文档压缩与实体提示的端到端框架——DCEPSum (Document Compression and Entity Prompted Summarization)，通过“软筛选”文档压缩与实体提示相结合，以生成连贯的摘要。首先，文档压缩模块基于句子、实体与章节节点构建异质图，并采用多头图注意力网络聚合上下文信息并输出句子重要性权重，保留高权重句子以降低噪声并增强跨句关联性。随后，为引导模型关注关键实体信息并减少噪声干扰，引入关键实体选择与实体提示机制。在生成阶段，该框架基于状态空间模型扩展的编码器结构中，将选定的实体作为前缀提示插入输入上下文，实现最终摘要生成。在两个基准数据集上的实验结果表明，DCEPSum的ROUGE-2 (Recall-Oriented Understudy for Gisting Evaluation) 分别为23.18与20.86，ROUGE-L分别为46.63与45.83；相较最强基准LSG(Local, Sparse and Global attention) (16k)，ROUGE-2分别提高0.76与0.67，ROUGE-L分别提高2.31与3.14。文档压缩与实体提示的结合能够在可控计算开销下改善长文档摘要质量，为长上下文摘要建模提供可行方案。

关键词: 长文档摘要, 文档压缩, 实体提示, 图神经网络, 状态空间模型

Abstract: Long-document summarization remains challenging due to complex document structures and excessive input length, which often introduce substantial noise into generative models and hinder effective knowledge acquisition. To mitigate this issue, a method integrating document compression and entity prompting was proposed to filter irrelevant content and reinforce the learning of key information. An end-to-end framework termed DCEPSum (Document Compression and Entity Prompted Summarization) was developed, in which soft-selection document compression was combined with entity prompting to generate coherent summaries. First, within the document compression module, a heterogeneous graph was constructed over sentence, entity, and section nodes, and a multi-head graph attention network was employed to aggregate contextual information and output sentence-importance weights; sentences with high weights were retained to reduce noise and enhance cross-sentence associations. Subsequently, to guide the model to focus on key entity information and reduce noise interference, a key entity selection and entity prompting mechanism was introduced. During generation, within an encoder architecture extended by a state space model, the selected entities were inserted into the input context as prefix prompts, and the final summaries were generated. Experimental results on two benchmark datasets show that the ROUGE (Recall-Oriented Understudy for Gisting Evaluation) -2 scores of DCEPSum are 23.18 and 20.86, and the ROUGE-L scores are 46.63 and 45.83; compared with the strongest baseline LSG (Local, Sparse and Global attention) (16k), ROUGE-2 increases by 0.76 and 0.67, and ROUGE-L increases by 2.31 and 3.14. The combination of document compression and entity prompting improves long-document summarization quality under controllable computational overhead and provides a feasible solution for long-context summarization modeling.

Key words: long document summarization, document compression, entity prompt, Graph Neural Network (GNN), State-Space Model (SSM)

中图分类号:

TP391.1

李咨谷陈景强. 基于文档压缩与实体提示的长文档摘要可行性[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2025101263.

[1]	尹春勇, 张不凡. 基于多尺度的多变量时间序列异常检测模型[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 790-797.
[2]	王日龙, 李振平, 李晓松, 高强, 何亚, 钟勇, 赵英潇. 多Agent协作的知识推理框架[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 708-714.
[3]	文洪建, 胡瑞娇, 吴保文, 孙家兴, 李环, 张晴, 刘杰. 基于图神经网络实现多尺度特征联合学习的中文作文自动评分[J]. 《计算机应用》唯一官方网站, 2026, 46(2): 378-385.
[4]	吴俊锐, 杨江川, 喻海生, 邹赛, 汪文勇. 基于复增强注意力机制图神经网络的确定性网络性能评估方法[J]. 《计算机应用》唯一官方网站, 2026, 46(2): 505-517.
[5]	姜皓骞, 张东, 李冠宇, 陈恒. 基于结构增强的层次化任务导向提示策略的对话推荐系统SetaCRS[J]. 《计算机应用》唯一官方网站, 2026, 46(2): 368-377.
[6]	林金娇, 张灿舜, 陈淑娅, 王天鑫, 连剑, 徐庸辉. 基于改进图注意力网络的车险欺诈检测方法[J]. 《计算机应用》唯一官方网站, 2026, 46(2): 437-444.
[7]	何凡, 李理, 苑中旭, 杨秀, 韩东轩. 融合图注意力的概念关联记忆网络知识追踪模型[J]. 《计算机应用》唯一官方网站, 2026, 46(1): 43-51.
[8]	李玟, 李开荣, 杨凯. 基于数据增强的子图感知对比学习[J]. 《计算机应用》唯一官方网站, 2026, 46(1): 1-9.
[9]	杨兴耀, 齐正, 于炯, 张祖莲, 马帅, 沈洪涛. 时间感知和空间增强的双通道图神经网络会话推荐模型[J]. 《计算机应用》唯一官方网站, 2026, 46(1): 104-112.
[10]	卢燕群, 赵奕奕. 基于层次图神经网络和差异化特征学习的客户流失预测模型[J]. 《计算机应用》唯一官方网站, 2025, 45(9): 3057-3066.
[11]	刘超, 余岩化. 融合降噪策略与多视图对比学习的知识感知推荐模型[J]. 《计算机应用》唯一官方网站, 2025, 45(9): 2827-2837.
[12]	梁永濠, 李金龙. 用于神经布尔可满足性问题求解器的新型消息传递网络[J]. 《计算机应用》唯一官方网站, 2025, 45(9): 2934-2940.
[13]	赵彪, 秦玉华, 田荣坤, 胡月航, 陈芳锐. 依赖类型及距离增强的方面级情感分析模型[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2507-2514.
[14]	蒋权, 黄文清, 苟志勇. 基于等变图神经网络的拉格朗日粒子流模拟[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2666-2671.
[15]	王义, 马应龙. 基于项图动态适应性生成的多任务社交项推荐方法[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2592-2599.

基于文档压缩与实体提示的长文档摘要可行性

Long document summarization feasibility with document compression and entity prompt mechanism

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics