《计算机应用》唯一官方网站

• •    下一篇

融合情感词典的多视角语言特征方面情感三元组抽取模型

张正悦,彭菊红,丁子胥,范馨予,胡长玉   

  1. 湖北大学
  • 收稿日期:2025-01-03 修回日期:2025-03-30 发布日期:2025-04-16 出版日期:2025-04-16
  • 通讯作者: 彭菊红
  • 基金资助:
    湖北省重点研发项目

Aspect sentiment triplet extraction model with multi-view linguistic features and sentiment lexicon

  • Received:2025-01-03 Revised:2025-03-30 Online:2025-04-16 Published:2025-04-16

摘要: 在自然语言处理(NLP)任务中,方面情感三元组抽取(ASTE)旨在识别文本中方面词,观点词和情感极性之间的联系,是现实细粒度情感分析的关键步骤。在当前的主流方法中,端到端模型普遍存在对语言特征理解不足和情感表达稀疏性处理不佳的问题,限制了模型的准确性和鲁棒性,管道式模型存在传播错误问题。针对上述问题,提出一种融合情感词典的多视角语言特征方面情感三元组抽取模型(MVLF-SL)。多视角语言特征能够帮助模型理解上下文和隐含语义,情感词典能够提供额外的情感先验知识。首先,利用图卷积神经网络(GCN)对多视角的语言特征进行特征表达,得到增强的语言特征;其次,使用动态融合策略将增强的语言特征与情感词典融合;再次,利用多层GCN结合邻接关系和节点特征来增强方面词和观点词的特征表示;最后,利用双仿射注意力机制改进的边界驱动表格填充方法(BDTF)对三元组的解码和抽取。实验结果表明,在ASTE-DATA-V2数据集的四个子数据集14res,14lap,15res,16res上,与BDTF模型相比F1值提升0.57、2.08、2.2、1.93个百分点。可见,本文提出的模型能更好的对方面情感三元组进行抽取,充分利用语言特征和外部情感知识。

关键词: 方面情感三元组抽取, 句法依赖关系, 词性信息, 句法依赖距离, 图卷积网络

Abstract: In Natural Language Processing (NLP) tasks, Aspect Sentiment Triplet Extraction (ASTE) aims to identify therelationships among aspect terms, opinion terms, and sentiment polarity in text, serving as a crucial step in achieving fine-grained sentiment analysis. In current mainstream approaches, end-to-end models generally suffer from insufficient understanding of linguistic features and poor handling of the sparsity in sentiment expressions, which limits their accuracy and robustness. Pipeline models are prone to error propagation. To address these issues, Multi-View Language Feature-based Sentiment Lexicon (MVLF-SL) model was proposed. Multi-view linguistic features were utilized to enhance the model’s ability to understand context and implicit semantics, while a sentiment lexicon provided additional prior knowledge of sentiment. First, GraphConvolutional Networks (GCN) were used to represent multi-view linguistic features and obtain enhanced representations. Then, a dynamic fusion strategy was adopted to integrate the enhanced language features with sentiment lexicon information. After that, multi-layer GCNs were employed to further enhance the feature representations of aspect and opinion terms by incorporating adjacency relations and node features. Finally, a Boundary-DrivenTable-Filling (BDTF) method, improved with a biaffine attention mechanism, was used for decoding and extracting sentimenttriplets. Experimental results on four subsets (14res,14lap, 15res, and 16res) of the ASTE-DATA-V2 dataset show that, compared with the BDTF model, the F1 scores were improved by 0.57, 2.08, 2.20, and 1.93 percentage points, respectively. Theseresults indicate that the proposed model achieves better performance in aspect sentiment triplet extraction byeffectively leveraging linguistic features and external sentiment knowledge.

Key words: Aspect Sentiment Triplet Extraction(ASTE), syntactic dependency relations, Part-of-Speech Information, syntactic dependency distance, Graph Convolutional Network(GCN)

中图分类号: