Journal of Computer Applications ›› 2025, Vol. 45 ›› Issue (12): 3839-3846.DOI: 10.11772/j.issn.1001-9081.2024121841

• Artificial intelligence • Previous Articles     Next Articles

CovMW-net: robust text matching method based on meta-weight network

Dongwei ZHANG1, Zheng YE1,2, Jun GE3   

  1. 1.Key Laboratory of Cyber-Physical Fusion Intelligent Computing,State Ethnic Affairs Commission (South-Central Minzu University),Wuhan Hubei 430074,China
    2.Key Laboratory of Ethnic Language Intelligent Analysis and Security Governance,Ministry of Education (Minzu University of China),Beijing 100081,China
    3.College of International Business and Economics,Wuhan Textile University,Wuhan Hubei 430202,China
  • Received:2024-12-31 Revised:2025-02-24 Accepted:2025-04-07 Online:2025-06-05 Published:2025-12-10
  • Contact: Zheng YE
  • About author:ZHANG Dongwei, born in 1999, M. S. candidate. His research interests include natural language processing.
    YE Zheng, born in 1981, Ph. D., professor. His research interests include natural language processing.
    GE Jun, born in 1981, M. S., lecturer. Her research interests include natural language processing.
  • Supported by:
    Fundamental Research Funds for South-Central Minzu University(CZZ24009);Talent Introduction Funds for South-Central Minzu University(YZZ20001);Academic Innovation Team and Research Platform Project of South-Central Minzu University(XTZ24003)

基于元权重网络的鲁棒性文本匹配方法CovMW-net

张东伟1, 叶正1,2, 葛君3   

  1. 1.信息物理融合智能计算国家民委重点实验室(中南民族大学),武汉 430074
    2.教育部民族语言智能分析与安全治理重点实验室(中央民族大学),北京 100081
    3.武汉纺织大学 外经贸学院,武汉 430202
  • 通讯作者: 叶正
  • 作者简介:张东伟(1999—),男(苗族),湖南湘西人,硕士研究生,主要研究方向:自然语言处理
    叶正(1981—),男,湖北鄂州人,教授,博士,CCF会员,主要研究方向:自然语言处理
    葛君(1981—),女,湖北咸宁人,讲师,硕士,主要研究方向:自然语言处理。
  • 基金资助:
    中南民族大学基础研究基金资助项目(CZZ24009);中南民族大学人才引进计划基金资助项目(YZZ20001);中南民族大学学术创新团队及研究平台项目(XTZ24003);中南民族大学学术创新团队及研究平台项目(PTZ24001)

Abstract:

In text matching tasks, the complexity and diversity of textual data often lead to issues of lacking robustness during training. Traditional methods to address the lack of robustness, such as data augmentation and regularization, can be effective, but are often only applicable to specific types of noise or disturbances, and require a lot of computational resources. Therefore, a method based on Meta-Weight network (MW-net) — Meta-Weight network improved by the Covariance matrix (CovMW-net) was proposed. Firstly, the weight parameters and loss functions were adjusted by learning adaptively, thereby realizing rapid and reasonable weight distribution. Then, by controlling the weights of samples, the impacts of samples on training effects were magnified or diminished, and ultimately the training robustness was enhanced. The meta-learning framework of MW-net was inherited by CovMW-net, thereby saving computational resources. At the same time, by CovMW-net, through incorporating covariance matrices, deep feature extraction for samples in each category was conducted, and the covariance matrices of these features were calculated to measure minority class data, thereby mitigating the negative impacts of long-tail distributions caused by random sampling from meta-datasets in MW-net. Experimental results on the Clothing1M dataset show that CovMW-net outperforms the original method MW-net by 0.86 percentage points in accuracy and outperforms all comparative methods. In addition, on the Large-scale Chinese Question Matching Corpus (LCQMC) and Baidu Question-answer matching dataset (BQ), CovMW-net has the accuracy improvements between 4 and 6 percentage points mostly compared to the baseline. It can be seen that CovMW-net is effective in dealing with biases in meta-datasets and is feasible for application in research on the robustness of text matching.

Key words: text matching, robustness, meta-weight, covariance matrix, sample reweighting

摘要:

在文本匹配任务中,文本数据的复杂性与多样性使训练时常暴露出鲁棒性欠佳的问题。传统的解决文本鲁棒性不足的手段,诸如数据增强和正则化等,虽能发挥一定作用,但这些方法大多仅适用于特定类型的噪声或扰动,并且对计算资源的需求较高。因此,提出一种基于元权重网络(MW-net)的方法——协方差矩阵改进的元权重网络(CovMW-net)。首先,通过自适应学习调整权重参数与损失函数,从而迅速实现较合理的权重分配。其次,借助对样本权重的调控,放大或缩小样本在训练过程中对训练效果的影响,最终实现提升训练鲁棒性的目的。CovMW-net继承MW-net的元学习框架,进而节约计算资源。同时,它融合协方差矩阵,针对每个类别的样本开展深度特征提取,计算这些特征的协方差矩阵,并以此度量少数类数据,进而削减MW-net因为元数据集随机取样而产生长尾分布所造成的负面影响。在Clothing1M数据集上的实验结果表明,CovMW-net在准确率上超过原始方法MW-net 0.86个百分点,并优于所有对比方法。此外,在大规模中文问题匹配语料库(LCQMC)和百度问答匹配数据集(BQ)上CovMW-net的准确率相较于baseline提升大多在4~6个百分点。可见,CovMW-net在处理元数据集存在偏差时具备有效性,且应用于文本匹配鲁棒性研究时具有可行性。

关键词: 文本匹配, 鲁棒性, 元权重, 协方差矩阵, 样本重加权

CLC Number: