• •    

CCDM2022+223 J-SGPGN:基于序列与图的联合学习复述生成网络

侯志荣,范晓东,张华,马晓楠   

  1. 工银科技有限公司
  • 收稿日期:2022-05-05 修回日期:2022-05-13 发布日期:2022-06-29
  • 通讯作者: 侯志荣

J-SGPGN: Paraphrase Generation Networks based on Joint Learning of Sequence and Graph

  • Received:2022-05-05 Revised:2022-05-13 Online:2022-06-29

摘要: 复述生成是一种基于自然语言生成的文本数据增强方法。针对基于Seq2Seq框架的复述生成方法中出现的生成重复、语意错误及多样性差的问题,提出一种基于序列和图的联合学习复述生成网络(J-SGPGN)。首先,J-SGPGN的编码器融合了图编码和序列编码进行特征增强。然后,解码器设计了序列生成和图生成两种解码方式并行解码。最后,采用联合学习方法训练模型,旨在兼顾句法监督与语义监督同步提升生成的准确性和多样性。实验结果表明,J-SGPGN在Quora数据集上的生成准确性指标METEOR较准确性最优基线模型(RNN+GCN)提升了6.69%;生成多样性指标Self-BLEU较多样性最优基线模型(BTmPG)提升了3.75%。验证了J-SGPGN能够生成语义更准确,表达方式更多样的复述文本。

关键词: 复述生成, 编码器-解码器, 自注意力网络, 序列生成, 图生成, 联合学习

Abstract: Abstract: Paraphrase generation is a text data argumentation method based on natural language generation. Concerning the problems of repetitive generation, semantic errors and poor diversity in paraphrase generation methods based on the Seq2Seq framework, a paraphrase generation network based on joint learning of sequence and graph (J-SGPGN) was proposed. First, the encoder of J-SGPGN fused graph encoding and sequence encoding for feature enhancement. Then, the decoder was designed to contain two decoding methods, sequence generation and graph generation, for parallel decoding. Finally, the joint learning method was used to train the model, which aimed to simultaneously improve the accuracy and diversity of generation by taking into account both syntactic supervision and semantic supervision. The experimental results show that the generation accuracy evaluation indicator METEOR of J-SGPGN on the Quora dataset was 6.69% higher than the accuracy optimal baseline model (RNN+GCN). The generative diversity evaluation indicator Self-BLEU was 3.75% higher than the diversity optimal baseline model (BTmPG). It was verified that J-SGPGN could generate paraphrase texts with more accurate semantics and more diverse expressions.

Key words: paraphrase generation, encoder-decoder, self-attention network, sequence generation, graph generation, joint learning

中图分类号: