计算机应用 ›› 2020, Vol. 40 ›› Issue (1): 16-22.DOI: 10.11772/j.issn.1001-9081.2019060968

• 人工智能 • 上一篇    下一篇

基于BiLSTM-CNN串行混合模型的文本情感分析

赵宏, 王乐, 王伟杰   

  1. 兰州理工大学 计算机与通信学院, 兰州 730050
  • 收稿日期:2019-06-10 修回日期:2019-08-08 出版日期:2020-01-10 发布日期:2019-10-08
  • 通讯作者: 王乐
  • 作者简介:赵宏(1971-),男,甘肃西和人,教授,博士,CCF会员,主要研究方向:并行与分布式处理、自然语言处理、深度学习;王乐(1994-),女,甘肃玉门人,硕士研究生,CCF会员,主要研究方向:自然语言处理、深度学习、情感分析;王伟杰(1994-),女,黑龙江齐齐哈尔人,硕士研究生,CCF会员,主要研究方向:声纹识别、深度学习。
  • 基金资助:
    国家自然科学基金资助项目(51668043);赛尔网络下一代互联网技术创新项目(NG1120160311,NG1120160112)。

Text sentiment analysis based on serial hybrid model of bi-directional long short-term memory and convolutional neural network

ZHAO Hong, WANG Le, WANG Weijie   

  1. School of Computer and Communication, Lanzhou University of Technology, Lanzhou Gansu 730050, China
  • Received:2019-06-10 Revised:2019-08-08 Online:2020-01-10 Published:2019-10-08
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (51668043), the CERNET Innovation Project (NGII20160311, NGII20160112).

摘要: 针对现有文本情感分析方法准确率不高、实时性不强以及特征提取不充分的问题,构建了双向长短时记忆神经网络和卷积神经网络(BiLSTM-CNN)的串行混合模型。首先,利用双向循环长短时记忆(BiLSTM)神经网络提取文本的上下文信息;然后,对已提取的上下文特征利用卷积神经网络(CNN)进行局部语义特征提取;最后,使用Softmax得出文本的情感倾向。通过与CNN、长短时记忆神经网络(LSTM)、BiLSTM等单一模型对比,所提出的文本情感分析模型在综合评价指标F1上分别提高了2.02个百分点、1.18个百分点和0.85个百分点;与长短时记忆神经网络和卷积神经网络(LSTM-CNN)、BiLSTM-CNN并行特征融合等混合模型对比,所提出的文本情感分析模型在综合评价指标F1上分别提高了1.86个百分点和0.76个百分点。实验结果表明,基于BiLSTM-CNN的串行混合模型在实际应用中具有较大的价值。

关键词: 文本情感分析, 上下文信息, 语义特征, 长短时记忆神经网络, 卷积神经网络

Abstract: Aiming at the problems of low accuracy, poor real-time performance and insufficient feature extraction in existing text sentiment analysis methods, a serial hybrid model based on Bi-directional Long Short-Term Memory neural network and Convolutional Neural Network (BiLSTM-CNN) was constructed. Firstly, the context information was extracted from the text by using Bi-directional Long Short Term Memory (BiLSTM) neural network. Then, the local semantic features were extracted from the context information by using Convolutional Neural Network (CNN). Finally, the emotional tendency of text was obtained by using Softmax. Compared with single models such as CNN, Long Short-Term Memory (LSTM) and BiLSTM, the proposed text sentiment analysis model increases the comprehensive evaluation index F1 by 2.02 percentage points, 1.18 percentage points and 0.85 percentage points respectively; and compared with the hybrid models such as LSTM and CNN (LSTM-CNN) and parallel features fusion of BiLSTM-CNN, the proposed text sentiment analysis model improves the comprehensive evaluation index F1 by 1.86 percentage points and 0.76 percentage points respectively. The experimental results show that the serial hybrid model based on BiLSTM-CNN has great value in practical applications.

Key words: text sentiment analysis, context information, semantic feature, Long Short-Term Memory (LSTM) neural network, Convolutional Neural Network (CNN)

中图分类号: