Machine reading comprehension model integrating dynamic interaction and contrastive learning

doi:10.11772/j.issn.1001-9081.2024060893

Journal of Computer Applications ›› 0, Vol. ›› Issue (): 0-0.DOI: 10.11772/j.issn.1001-9081.2024060893

Received:2024-06-28 Revised:2024-11-04 Online:2024-11-11 Published:2024-11-11
Contact: Fan YANG

融合动态交互和对比学习的机器阅读理解模型

方宇涵¹,²,杨凡³,张庆⁴

1. 中国科学院成都计算机应用研究所
2. 中国科学院大学
3. 中科院成都计算所
4. 中国科学院大学成都计算机应用研究所

通讯作者: 杨凡
基金资助:
基于统一标识的川渝数据要素联合技术攻关与规模化试点

Abstract

Abstract: Aiming at the problems of extraction position bias, answer redundancy and insufficient sample data of pre-trained language model in extractive machine reading comprehension tasks, a machine reading comprehension model integrating dynamic interaction and contrastive learning was proposed. Firstly, the decoding layer of the pre-trained model was improved to the interactive prediction layer, and the dynamic self-attention and dynamic query mechanisms were introduced for answer prediction; secondly, the key positions were selected from the semantic vector output by the pre-trained model through the TopK algorithm, and the features of these positions were enhanced by the multi-head self-attention mechanism; then, the dynamic query vector was calculated based on the enhanced semantic vector and the static query vector, and the answer prediction vector was output; finally, in the loss calculation stage, negative samples were constructed to realize contrastive learning, and the ternary loss was introduced to avoid overfitting. Experimental results show that, on the CMRC2018 (Chinese Machine Reading Comprehension 2018) dataset, compared with the baseline model RoBERTa-wwm-ext-large (Robustly optimized BERT approach with Whole Word Masking extended large) , the F1 value and EM (Exact Match) value of this method are improved by 1.82 and 1.29 percentage points respectively; on the SQuADv1.1 (Stanford Question Answering Dataset version 1.1) English dataset, compared with the baseline model RoBERTa (Robustly optimized BERT approach) , the F1 value and EM value of this method are improved by 1.18 and 0.58 percentage points respectively, which is better than most existing machine reading comprehension models. This verifies the effectiveness and generalization of the proposed algorithm, and can complete more accurate and reliable reading comprehension tasks.

Key words: Machine Reading Comprehension, Pre-trained Models, Span Extraction, Dynamic Interaction, Triplet Loss

摘要： 摘要: 针对抽取式机器阅读理解任务中存在的抽取位置偏差、答案冗余和预训练语言模型样本数据不足问题，提出了一种融合动态交互和对比学习的机器阅读理解模型。首先，将预训练模型解码层改进为交互预测层，引入动态自注意力和动态查询机制进行答案预测；其次，通过TopK算法从预训练模型输出的语义向量中选择关键位置，经多头自注意力机制增强这些位置的特征；随后，基于增强语义向量与静态查询向量计算出动态查询向量并输出答案预测向量；最后，在损失计算阶段，构建负样本实现对比学习，引入三元损失避免过拟合。实验结果显示，在CMRC2018（Chinese Machine Reading Comprehension 2018）数据集上，与基线模型RoBERTa-wwm-ext-large（Robustly optimized BERT approach with whole word masking extended large）相比，本方法的F1值和EM（Exact Match）值分别提高了1.82和1.29个百分点；在SQuADv1.1（Stanford Question Answering Dataset version 1.1）英文数据集上，与基线模型RoBERTa（Robustly optimized BERT approach）相比，本方法的F1值和EM值分别提高了1.18和0.58个百分点，优于大多数现有机器阅读理解模型，验证了所提算法的有效性和泛化性，可完成更为精准和可靠的阅读理解任务。

关键词: 机器阅读理解, 预训练模型, 片段抽取, 动态交互, 三元损失

CLC Number:

TP391.1

方宇涵杨凡张庆. 融合动态交互和对比学习的机器阅读理解模型[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.

[1]	Yuhang XIAO, Guanfeng LI, Yuyin CHEN, Jing QIN. Few-shot relation extraction model with graph-based multi-view contrastive learning [J]. Journal of Computer Applications, 2026, 46(3): 732-740.
[2]	Hanyue WEI, Chenjuan GUO, Jieyuan MEI, Jindong TIAN, Peng CHEN, Ronghui XU, Bin YANG. MATCH： multimodal stock prediction framework integrating time-frequency features and hybrid text [J]. Journal of Computer Applications, 2026, 46(2): 427-436.
[3]	Qing YANG, Yan ZHU. Metaphor detection for improving representation in linguistic rules [J]. Journal of Computer Applications, 2025, 45(8): 2491-2496.
[4]	Jie YANG, Tashi NYIMA, Dongrub RINCHEN, Jindong QI, Dondrub TSHERING. Tibetan word segmentation system based on pre-trained model tokenization reconstruction [J]. Journal of Computer Applications, 2025, 45(4): 1199-1204.
[5]	Jiaxin LI, Site MO. Power work order classification in substation area based on MiniRBT-LSTM-GAT and label smoothing [J]. Journal of Computer Applications, 2025, 45(4): 1356-1362.
[6]	Haitao SUN, Jiayu LIN, Zuhong LIANG, Jie GUO. Data augmentation technique incorporating label confusion for Chinese text classification [J]. Journal of Computer Applications, 2025, 45(4): 1113-1119.
[7]	Yuchen HONG, Jinlong LI. Symbolic music generation with pre-training [J]. Journal of Computer Applications, 2025, 45(2): 578-583.
[8]	Kaipeng XUE, Tao XU, Chunjie LIAO. Multimodal sentiment analysis network with self-supervision and multi-layer cross attention [J]. Journal of Computer Applications, 2024, 44(8): 2387-2392.
[9]	Chenyang LI, Long ZHANG, Qiusheng ZHENG, Shaohua QIAN. Multivariate controllable text generation based on diffusion sequences [J]. Journal of Computer Applications, 2024, 44(8): 2414-2420.
[10]	Qing LIU, Yanping CHEN, Anqi ZOU, Ruizhang HUANG, Yongbin QIN. Boundary-aware approach to machine reading comprehension [J]. Journal of Computer Applications, 2024, 44(7): 2004-2010.
[11]	Zhengyu ZHAO, Jing LUO, Xinhui TU. Information retrieval method based on multi-granularity semantic fusion [J]. Journal of Computer Applications, 2024, 44(6): 1775-1780.
[12]	Hang YU, Yanling ZHOU, Mengxin ZHAI, Han LIU. Text classification based on pre-training model and label fusion [J]. Journal of Computer Applications, 2024, 44(3): 709-714.
[13]	Kaitian WANG, Qing YE, Chunlei CHENG. Classification method for traditional Chinese medicine electronic medical records based on heterogeneous graph representation [J]. Journal of Computer Applications, 2024, 44(2): 411-417.
[14]	Yingjie GAO, Min LIN, Siriguleng, Bin LI, Shujun ZHANG. Prompt learning method for ancient text sentence segmentation and punctuation based on span-extracted prototypical network [J]. Journal of Computer Applications, 2024, 44(12): 3815-3822.
[15]	Xiang LIN, Biao JIN, Weijing YOU, Zhiqiang YAO, Jinbo XIONG. Model integrity verification framework of deep neural network based on fragile fingerprint [J]. Journal of Computer Applications, 2024, 44(11): 3479-3486.