Journal of Computer Applications ›› 2019, Vol. 39 ›› Issue (7): 1931-1935.DOI: 10.11772/j.issn.1001-9081.2018112375

• Artificial intelligence • Previous Articles     Next Articles

Sentiment analysis method combining sentiment and semantic information

MENG Shilin1, ZHAO Yunlong1,2, GUAN Donghai1,2, ZHAI Xiangping1,2   

  1. 1. School of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing Jiangsu 211106, China;
    2. Collaborative Innovation Center of Novel Software Technology and Industrialization, Nanjing Jiangsu 210023, China
  • Received:2018-12-03 Revised:2019-02-17 Online:2019-07-15 Published:2019-07-10
  • Supported by:

    This work is partially supported by the National Defense Basic Scientific Research Program (JCKY2016605B006), the Six Talent Peaks Project of Jiangsu Province (XYDXXJS-031), the Fundamental Research Funds for the Central Universities (90YAH16042).

融合情感与语义信息的情感分析方法

孟仕林1, 赵蕴龙1,2, 关东海1,2, 翟象平1,2   

  1. 1. 南京航空航天大学 计算机科学与技术学院, 南京 211106;
    2. 软件新技术与产业化协同创新中心, 南京 210023
  • 通讯作者: 赵蕴龙
  • 作者简介:孟仕林(1994-),男,安徽界首人,硕士研究生,CCF会员,主要研究方向:数据挖掘、情感分析;赵蕴龙(1975-),男,黑龙江哈尔滨人,教授,博士,CCF会员,主要研究方向:无线网络、群体计算、说话人识别、数据挖掘、可穿戴计算技术;关东海(1981-),男,黑龙江哈尔滨人,副教授,博士,CCF会员,主要研究方向:语音识别、数据挖掘、人工智能;翟象平(1984-),男,山东淄博人,副教授,博士,CCF会员,主要研究方向:普适计算、蜂群无人系统、群体智能、数据挖掘。
  • 基金资助:

    国防基础科研计划资助项目(JCKY2016605B006);江苏省"六大人才高峰"高层次人才项目(XYDXXJS-031);中央高校基本科研业务费专项(90YAH16042)。

Abstract:

When using word embedding method for word-to-vector, two antonyms are converted into similar vectors. If they are sentiment words, it will lead to the loss of sentimental information, which is unreasonable in sentiment analysis task. To solve this problem, a method of adding sentiment vectors to obtain sentiment information based on word embedding was proposed. Firstly, the sentiment vector was constructed by using sentiment lexicon, and combined with word vector obtained by word embedding method. Then, a bidirectional Long Short Term Memory (BiLSTM) network was used to obtain the characteristics of text. Finally, the sentiment of text was classified. Experiments of the proposed method and the method without fusing sentimental vector were carried out on four datasets. The experimental results show that the classification accuracy and F1 score of the proposed method are higher than those of the method without fusion, which indicates that adding sentimental vectors is beneficial to improve the performance of sentiment analysis.

Key words: sentiment analysis, word embedding, sentiment word, sentiment information, bidirectional LSTM (BiLSTM)

摘要:

在使用词嵌入法进行词转向量时,两个反义词会转换成相近的向量。如果这两个词是情感词,将会导致词的情感信息的丢失,这在情感分析任务中是不合理的。为了解决这个问题,提出了一种在词嵌入的基础上增加情感向量来获取情感信息的方法。首先利用情感词典资源构建情感向量,将其与词嵌入法得到的词向量融合在一起;然后采用双向长短期记忆(BiLSTM)网络获取文本的特征;最后对文本的情感进行分类。在4个数据集上分别对该方法与未融合情感向量的方法进行了实验。实验结果表明所提方法分类准确度与F1值都高于未融合方法,说明了加入情感向量有助于提高情感分析的性能。

关键词: 情感分析, 词嵌入, 情感词, 情感信息, 双向长短期记忆网络

CLC Number: