基于多图神经网络和图对比学习的科学文献摘要模型

doi:10.11772/j.issn.1001-9081.2024121751

《计算机应用》唯一官方网站

• • 下一篇

基于多图神经网络和图对比学习的科学文献摘要模型

赵红燕¹,郭力华²,刘春霞³,王日云⁴

1. 太原科技大学计算机科学与技术学院
2. 太原科技大学计算机科学与技术学院
3. 太原科技大学计算机科学与技术学院,太原 030024
4. 太原科技大学

收稿日期:2024-12-12 修回日期:2025-03-02 发布日期:2025-03-13 出版日期:2025-03-13
通讯作者: 刘春霞
基金资助:
山西省自然科学面上基金项目;山西省重点实验室开放基金项目;太原科技大学博士科研启动基金项目

Scientific document summarization model based on multi-graph neural network and graph contrastive learning

Received:2024-12-12 Revised:2025-03-02 Online:2025-03-13 Published:2025-03-13
Contact: LIU chunChun-xia
Supported by:
the Natural Science Foundation of Shanxi Province;the Open Foundation of Key Laboratory of Shanxi Province;the Ph.D. Research Startup Foundation of Taiyuan University of Science and Technology

摘要/Abstract

摘要： 长文档摘要生成面临跨句关系的捕捉、长距离依赖及文档信息的高效编码与提取等难题，一直是自然语言处理领域的一个难点任务。然而，科学文献通常包含多个章节和段落，具有更加复杂的层次结构，使得科学文献摘要生成任务更具挑战性。针对以上问题，提出了一种基于多图神经网络和图对比学习的科学文献摘要模型（MGCSum）。对于输入的文档，该模型首先通过同构图和异构图神经网络分别建模句内与句间关系，生成初始句子表示；然后，将这些句子表示馈送到一个多头超图注意网络，利用自注意机制充分捕捉节点和边之间的关系，进一步更新和学习跨句子的表示；接着，引入图对比学习模块，增强全局主题感知，提升句子表示的语义一致性和区分度；最后，采用多层感知器和归一化层计算一个得分，用于判断句子是否应被选为摘要。在PubMed和ArXiv数据集上的实验结果表明，MGCSum模型的表现优于多数对比模型。在PubMed数据集上，MGCSum的ROUGE-1、ROUGE-2和ROUGE-L分别达到了48.97%、23.15%和44.09%，相比现有的先进模型HAESum，分别提高了0.2、0.71和0.26个百分点。MGCSum通过结合多图神经网络和图对比学习，能够更有效地捕捉文献的层次结构信息和跨句关系，提升摘要生成的准确性和语义一致性，展现了其在科学文献摘要生成任务中的优势。

关键词: 科学文献摘要, 抽取式摘要, 图神经网络, 超图注意网络, 图对比学习

Abstract: Summarizing long documents has been identified as a challenging task in natural language processing due to difficulties in capturing inter-sentence relationships, handling long-range dependencies, and efficiently encoding and extracting document information. Scientific documents, characterized by multiple chapters and paragraphs with complex hierarchical structures, further increased the difficulty of the summarization task. To address these challenges, a scientific document summarization model based on multi-graph neural networks and graph contrastive learning, named MGCSum, was proposed. First, for a given input document, intra-sentence and inter-sentence relationships were modeled using homogeneous and heterogeneous graph neural networks, respectively, to generate initial sentence representations. These representations were subsequently fed into a multi-head hypergraph attention network, where self-attention mechanisms were leveraged to capture relationships between nodes and edges, enabling the updating and learning of cross-sentence representations. Then, a graph contrastive learning module was introduced to enhance global topic awareness, improving the semantic consistency and discriminability of sentence representations. Finally, a multi-layer perceptron and a normalization layer were applied to calculate scores for determining whether sentences should be selected for summarization. The experimental results on the PubMed and ArXiv datasets indicate that the MGCSum model outperformed most baseline models. On the PubMed dataset, MGCSum achieved ROUGE-1, ROUGE-2, and ROUGE-L scores of 48.97%, 23.15%, and 44.09%, respectively, with improvements of 0.2, 0.71, and 0.26 percentage points over the state-of-the-art model HAESum. By integrating multi-graph neural networks and graph contrastive learning, MGCSum effectively captures hierarchical structural information and inter-sentence relationships, enhancing the accuracy and semantic consistency of summarization, and demonstrating its advantages in scientific document summarization tasks.

Key words: scientific document summarization, extractive summarization, Graph Neural Network (GNN), HyperGraph ATtention network (HGAT), Graph Contrastive Learning (GCL)

中图分类号:

TP183

赵红燕郭力华刘春霞王日云. 基于多图神经网络和图对比学习的科学文献摘要模型[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2024121751.

[1]	王聪, 史艳翠. 基于多视角学习的图神经网络群组推荐模型[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1205-1212.
[2]	田仁杰, 景明利, 焦龙, 王飞. 基于混合负采样的图对比学习推荐算法[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1053-1060.
[3]	党伟超, 温鑫瑜, 高改梅, 刘春霞. 基于多视图多尺度对比学习的图协同过滤[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1061-1068.
[4]	游兰, 张雨昂, 刘源, 陈智军, 王伟, 曾星, 何张玮. 基于协作贡献网络的开源项目开发者推荐[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1213-1222.
[5]	蔡启健, 谭伟. 语义图增强的多模态推荐算法[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 421-427.
[6]	马汉达, 吴亚东. 多域时空层次图神经网络的空气质量预测[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 444-452.
[7]	赵文博, 马紫彤, 杨哲. 基于有向超图自适应卷积的链接预测模型[J]. 《计算机应用》唯一官方网站, 2025, 45(1): 15-23.
[8]	余肖生, 王智鑫. 基于多层次图对比学习的序列推荐模型[J]. 《计算机应用》唯一官方网站, 2025, 45(1): 106-114.
[9]	程子栋, 李鹏, 朱枫. 物联网威胁情报知识图谱中潜在关系的挖掘[J]. 《计算机应用》唯一官方网站, 2025, 45(1): 24-31.
[10]	杨航, 李汪根, 张根生, 王志格, 开新. 基于图神经网络的多层信息交互融合算法用于会话推荐[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2719-2725.
[11]	唐廷杰, 黄佳进, 秦进. 基于图辅助学习的会话推荐[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2711-2718.
[12]	杜郁, 朱焱. 构建预训练动态图神经网络预测学术合作行为消失[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2726-2731.
[13]	杨兴耀, 陈羽, 于炯, 张祖莲, 陈嘉颖, 王东晓. 结合自我特征和对比学习的推荐模型[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2704-2710.
[14]	杨莹, 郝晓燕, 于丹, 马垚, 陈永乐. 面向图神经网络模型提取攻击的图数据生成方法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2483-2492.
[15]	杨帆, 邹窈, 朱明志, 马振伟, 程大伟, 蒋昌俊. 基于图注意力Transformer神经网络的信用卡欺诈检测模型[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2634-2642.

基于多图神经网络和图对比学习的科学文献摘要模型

Scientific document summarization model based on multi-graph neural network and graph contrastive learning

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics