《计算机应用》唯一官方网站 ›› 2023, Vol. 43 ›› Issue (1): 37-44.DOI: 10.11772/j.issn.1001-9081.2021122099

• 人工智能 • 上一篇    下一篇

嵌入不同邻域表征的方面级情感分析模型

刘欢1, 窦全胜1,2   

  1. 1.山东工商学院 计算机科学与技术学院,山东 烟台 264005
    2.山东省高等学校协同创新中心:未来智能计算,山东 烟台 264005
  • 收稿日期:2021-12-17 修回日期:2022-06-22 发布日期:2022-09-23
  • 通讯作者: 窦全胜(1971—),男,黑龙江大庆人,教授,博士,CCF会员,主要研究方向:数据挖掘、自然语言处理、深度学习。li_dou@163.com
  • 作者简介:刘欢(1994—),女,陕西咸阳人,硕士研究生,主要研究方向:自然语言处理;窦全胜(1971—),男,黑龙江大庆人,教授,博士,CCF会员,主要研究方向:数据挖掘、自然语言处理、深度学习;
  • 基金资助:
    国家自然科学基金资助项目(61976125, 61976124)。

Aspect-based sentiment analysis model embedding different neighborhood representations

LIU Huan1, DOU Quansheng1,2   

  1. 1.School of Computer Science and Technology, Shandong Technology and Business University, Yantai Shandong 264005, China
    2.Co?innovation Center of Shandong Colleges and Universities: Future Intelligent Computing, Yantai Shandong 264005, China
  • Received:2021-12-17 Revised:2022-06-22 Online:2022-09-23
  • Contact: DOU Quansheng, born in 1971, Ph. D., professor. His research interests include data mining, natural language processing, deep learning.
  • About author:LIU Huan, born in 1994, M. S. candidate. Her research interests include natural language processing;DOU Quansheng, born in 1971, Ph. D., professor. His research interests include data mining, natural language processing, deep learning;
  • Supported by:
    This work is partially supported by National Natural Science Foundation of China (61976125, 61976124).

摘要: 方面级情感分析(ABSA)任务旨在识别特定方面的情感极性,然而现有的相关模型对结构不定的自然语句缺少对方面词上下文的短距离约束,且容易忽略句法关系,因而难以准确判定方面的情感极性。针对上述问题,提出嵌入不同邻域表征(EDNR)的ABSA模型。在该模型中,在获得句子语序信息的基础上,采用近邻策略并结合卷积神经网络(CNN)获取方面的邻域信息,减少较远无关信息对模型的影响;同时,引入语句的语法信息,增加单词之间的依赖关系;将上述两种特征融合后,使用Mask与注意力机制来特别关注方面信息,减少无用信息对情感分析模型的干扰。此外,为评价上下文和语法信息对情感极性的影响程度,提出一个信息评估系数。在5个公共数据集上进行实验的结果表明,与情感分析模型聚合图卷积网络-最大值函数(AGCN-MAX)相比,EDNR模型在数据集14Lap上的正确率和F1值分别提升了2.47和2.83个百分点。由此可见,EDNR模型可以有效捕获情感特征,提高分类性能。

关键词: 方面级情感分析, 邻域表征, 情感极性, 近邻策略, 信息评估系数

Abstract: The Aspect-Based Sentiment Analysis (ABSA) task aims to identify the sentiment polarity of a specific aspect. However, the existing related models lack the short-distance constraints on the context of the aspect word for the natural sentences with uncertain structure, and easily ignore the syntactic relations, so it is difficult to accurately determine the sentiment polarity of the aspect. Aiming at the above problems, an ABSA model with Embedding Different Neighborhood Representations (EDNR) was proposed. In this model, on the basis of obtaining the word order information of sentences, the nearest neighbor strategy combining with Convolution Neural Network (CNN) was used to obtain aspect neighborhood information, so as to reduce the influence of far irrelevant information on the model. At the same time, the grammatical information of sentences was introduced to increase the dependency between words. After fusing the two features, Mask and attention mechanism were used to pay special attention to the aspect information and reduce the interference of useless information to the sentiment analysis model. Besides, in order to evaluate the influence degree of contextual and grammatical information on sentiment polarity, an information evaluation coefficient was proposed. Experiments were carried out on five public datasets, and the results show that compared with the sentiment analysis model AGCN-MAX (Aggregated Graph Convolutional Network-MAX), the EDNR model has the accuracy and F1 score on dataset 14Lap improved by 2.47 percentage points and 2.83 percentage points respectively. It can be seen that the EDNR model can effectively capture emotional features and improve the classification performance.

Key words: Aspect-Based Sentiment Analysis (ABSA), neighborhood representation, sentiment polarity, nearest neighbor strategy, information evaluation coefficient

中图分类号: