• •    

CCF BigData2023+P00114+基于语境增强的新能源汽车投诉文本方面-观点对抽取

汪才钦,周渝皓,张顺香,王琰慧,王小龙   

  1. 安徽理工大学
  • 收稿日期:2023-08-31 修回日期:2023-09-12 发布日期:2023-12-18
  • 通讯作者: 张顺香
  • 基金资助:
    国家自然科学基金面上项目;安徽高校协同创新项目

Aspect-opinion pair extraction of new energy vehicle complaint text based on context enhancement

  • Received:2023-08-31 Revised:2023-09-12 Online:2023-12-18
  • Contact: Shun-xiang ZHANG
  • Supported by:
    The National Natural Science Foundation of China; the University Synergy Innovation Program of Anhui Province

摘要: 挖掘新能源汽车投诉文本中用户对产品多维度的意见,能够为产品的设计决策提供参考。因投诉文本具有实体密度高、句式冗长等特点,导致当前方面-观点对抽取方法感知方面项与观点项间的关联性不强。针对这一问题,提出一种基于语境增强的方面-观点对抽取模型(AOE-CE),通过融合主题特征与文本特征作为语境表示增强实体间的关联关系。模型由实体识别和关系检测两个模块组成,实体识别先通过预训练模型和词性标注工具编码文本,再利用双向长短期记忆网络(Bi-LSTM)结合多头注意力捕获上下文信息得到文本特征,并将其输至条件随机场(CRF)中得到实体集合。关系检测通过BERT获取主题特征,并将其与文本特征融合获得增强的语境表示,再利用三仿射机制以语境表示为辅助提升实体间的关联关系,最后通过Sigmoid得到抽取结果。实验结果显示,所提方法的准确率、召回率和F1值较SDRN模型分别提升了2.19%、1.08%和1.60%。

关键词: 方面-观点对抽取, 新能源汽车投诉文本, 语境增强, 三仿射机制, 多头注意力

Abstract: Mining users' multi-dimensional opinions on products in the complaint texts of new energy vehicles can provide support for product design decisions. Considering that the complaint text has the characteristics of high entity density and lengthy sentence structure, the existing methods for aspect-opinion pair extraction suffer from weak correlations between aspect terms and opinion terms. To locate this problem, this paper proposes an Aspect-Opinion pair Extraction model based on Context Enhancement(AOE-CE), fusing topic features and text features as contextual representation to enhance the correlation between entities. This model consists of an entity recognition module and relation detection module. Firstly, The entity recognition module encodes the text using a pre-trained model and part-of-speech tagging tool. Secondly, it employs Bi-directional Long Short-Term Memory (Bi-LSTM) combined with multi-head attention to capture contextual information and derive text features. Subsequently, These features are input into a Conditional Random Fields (CRF) model to obtain the entity set. The relationship detection module obtains the topic features through BERT, and fuses them with the text features to obtain the enhanced contextual representation. Then use the tri-affine mechanism to improve the relationship between entities with the help of contextual representation. Finally, the extraction result is obtained by Sigmoid. Meanwhile, the experimental result shows the accuracy rate, recognition rate and F1 value of the proposed method are 2.19%, 1.08% and 1.60% higher than the SDRN model respectively.

Key words: aspect-opinion pair extraction, the complaint texts of new energy vehicles, context enhancement, the tri-affine mechanism, multi-head attention

中图分类号: