Sentiment analysis based on parallel hybrid network and attention mechanism

doi:10.11772/j.issn.1001-9081.2019112020

Abstract

Abstract: Concerning the problems that the traditional Convolutional Neural Network (CNN) ignores the context and semantic information of words and loses a lot of feature information in maximal pooling processing, the traditional Recurrent Neural Network (RNN) has information memory loss and vanishing gradient, and both CNN and RNN ignore the importance of words to sentence meaning, a model based on parallel hybrid network and attention mechanism was proposed. First, the text was vectorized with Glove. After that, the CNN and the bidirectional threshold recurrent neural network were respectively used to extract text features with different characteristics through the embedding layer. Then, the features extracted by two networks were fused. And the attention mechanism was introduced to judge the importance of different words to the meaning of sentence. Multiple sets of comparative experiments were performed on the English corpus of IMDB. The experimental results show that the accuracy of the proposed model in text classification reaches 91.46% and F1-Measure reaches 91.36%.

Key words: Convolutional Neural Network (CNN), Bidirectional Gated Recurrent Unit (BGRU), feature fusion, attention mechanism, text sentiment analysis

摘要： 针对传统卷积神经网络（CNN）不仅会忽略词的上下文语义信息而且最大池化处理时会丢失大量特征信息的问题，传统循环神经网络（RNN）存在的信息记忆丢失和梯度弥散问题，和CNN和RNN都忽略了词对句子含义的重要程度的问题，提出一种并行混合网络融入注意力机制的模型。首先，将文本用Glove向量化；之后，通过嵌入层分别用CNN和双向门限循环神经网络提取不同特点的文本特征；然后，再把二者提取得到的特征进行融合，特征融合后接入注意力机制判断不同的词对句子含义的重要程度。在IMDB英文语料上进行多组对比实验，实验结果表明，所提模型在文本分类中的准确率达到91.46%而其F1-Measure达到91.36%。

关键词: 卷积神经网络, 双向门限循环单元, 特征融合, 注意力机制, 文本情感分析

CLC Number:

TP391

SUN Min, LI Yang, ZHUANG Zhengfei, YU Dawei. Sentiment analysis based on parallel hybrid network and attention mechanism[J]. Journal of Computer Applications, 2020, 40(9): 2543-2548.

孙敏, 李旸, 庄正飞, 余大为. 基于并行混合网络融入注意力机制的情感分析[J]. 计算机应用, 2020, 40(9): 2543-2548.

References

[1] 张膂. 基于餐饮评论的情感倾向性分析[D]. 昆明:昆明理工大学,2016:1.(ZHANG L. Analysis of sentiment orientation based on restaurant reviews[D]. Kunming:Kunming University of Science and Technology,2016:1.)
[2] 李然, 林政, 林海伦, 等. 文本情绪分析综述[J]. 计算机研究与发展,2018, 55(1):30-52.(LI R,LIN Z,LIN H L,et al. Text emotion analysis:a survey[J]. Journal of Computer Research and Development,2018,55(1):30-52.)
[3] 魏韡, 向阳, 陈千. 中文文本情感分析综述[J]. 计算机应用, 2011,31(12):3321-3323.(WEI W,XIANG Y,CHEN Q. Survey on Chinese text sentiment analysis[J]. Journal of Computer Applications,2011,31(12):3321-3323.)
[4] 程正双, 王亮. 基于支持向量机的网络评论情感分析方法[J]. 电子技术与软件工程,2019(16):3-4.(CHENG Z S,WANG L. Sentiment analysis method of network reviews based on support vector machine[J]. Electronic Technology and Software Engineering,2019(16):3-4.)
[5] 吴杰胜, 陆奎, 王诗兵. 基于多部情感词典与SVM的电影评论情感分析[J]. 阜阳师范学院学报(自然科学版),2019,36(2):68-72.(WU J S,LU K,WANG S B. Sentiment analysis of movie reviews based on multiple sentiment dictionaries and SVM[J]. Journal of Fuyang Normal University(Natural Science),2019,36(2):68-72.)
[6] 刘政, 卫志华, 张韧弦. 基于卷积神经网络的谣言检测[J]. 计算机应用,2017,37(11):3053-3056,3100.(LIU Z,WEI Z H, ZHANG R X. Rumor detection based on convolutional neural network[J]. Journal of Computer Applications,2017,37(11):3053-3056,3100.)
[7] 梁丕军. 基于循环神经网络和卷积神经网络的中文情感分类研究[D]. 长沙:湖南大学,2017:1.(LIANG P J. Research on Chinese sentiment classification based on recurrent neural network and convolutional neural network[D]. Changsha:Hunan University,2017:1.)
[8] KIM Y. Convolutional neural networks for sentence classification[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Stroudsburg,PA:Association for Computational Linguistics,2014:1746-1751.
[9] KALCHBRENNER N, GREFENSTETTE E, BLUNSOM P. A convolutional neural network for modelling sentences[C]//Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics,2014:655-665.
[10] MIKOLOV T, SUTSKEVER I, CHEN K, et al. Distributed representations of words and phrases and their compositionality[C]//Proceedings of the 26th International Conference on Neural Information Processing Systems. Red Hook, NY:Curran Associates Inc.,2013,26:3111-3119.
[11] HOCHREITER S,SCHMIDHUBER J. Long short-term memory[J]. Neural Computation,1997,9(8):1735-1780.
[12] CHO K, VAN MERRIËNBOER B, GULCEHRE C, et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA:Association for Computational Linguistics, 2014:1724-1734.
[13] MNIH V,HEESS N,GRAVES A,et al. Recurrent models of visual attention[C]//Proceedings of the 27th International Conference on Neural Information Processing Systems. Cambridge:MIT Press,2014:2204-2212.
[14] BAHDANAU D,CHO K,BENGIO Y. Neural machine translation by jointly learning to align and translate[EB/OL].[2019-03-20]. https://arxiv.org/pdf/1409.0473v7.pdf.
[15] 王伟, 孙玉霞, 齐庆杰, 等. 基于BiGRU-Attention神经网络的文本情感分类模型[J]. 计算机应用研究,2019,36(12):3558-3564.(WANG W,SUN Y X,QI Q J,et al. Text sentiment classification model based on BiGRU-attention neural network[J]. Application Research of Computers, 2019, 36(12):3558-3564..)
[16] 田生伟, 胡伟, 禹龙, 等. 结合注意力机制的Bi-LSTM维吾尔语事件时序关系识别[J]. 东南大学学报(自然科学版),2018,48(3):393-399.(TIAN S W,HU W,YU L,et al. Temporal relation identification of Uyghur event based on Bi-LSTM with attention mechanism[J]. Journal of Southeast University(Natural Science Edition),2018,48(3):393-399.)
[17] 白静, 李霏, 姬东鸿. 基于注意力的BiLSTM-CNN中文微博立场检测模型[J]. 计算机应用与软件,2018,35(3):266-274. (BAI J,LI F,JI D H. Attention based BiLSTM-CNN Chinese microblog position detection model[J]. Computer Applications and Software,2018,35(3):266-274.)
[18] 陈洁, 邵志清, 张欢欢, 等. 基于并行混合神经网络模型的短文本情感分析[JL]. 计算机应用,2019,39(8):2192-2197. (CHEN J,SHAO Z Q,ZHANG H H, et al. Short text sentiment analysis based on parallel hybrid neural network model[J]. Journal of Computer Applications,2019,39(8):2192-2197.)
[19] 王煜涵, 张春云, 赵宝林, 等. 卷积神经网络下的Twitter文本情感分析[J]. 数据采集与处理,2018,33(5):921-927.(WANG Y H,ZHANG C Y,ZHAO B L,et al. Sentiment analysis of Twitter data based on CNN[J]. Journal of Data Acquisition and Processing,2018,33(5):921-927.)
[20] XIAO Z, LIANG P. Chinese sentiment analysis using bidirectional LSTM with word embedding[C]//Proceedings of the 2016 International Conference on Cloud Computing and Security, LNCS 10040. Cham:Springer,2016:601-610.
[21] 曹宇, 李天瑞, 贾真, 等. BGRU:中文文本情感分析的新方法[J]. 计算机科学与探索,2019,13(6):973-981.(CAO Y,LI T R,JIA Z,et al. BGRU:new method of Chinese text sentiment analysis[J]. Journal of Frontiers of Computer Science and Technology,2019,13(6):973-981.)
[22] LAI S,XU L,LIU K,et al. Recurrent convolutional neural networks for text classification[C]//Proceedings of the 29th AAAI Conference on Artificial Intelligence. Palo Alto,CA:AAAI, 2015:2267-2273.