基于门控循环单元和胶囊特征的文本情感分析

doi:10.11772/j.issn.1001-9081.2020010128

计算机应用 ›› 2020, Vol. 40 ›› Issue (9): 2531-2535.DOI: 10.11772/j.issn.1001-9081.2020010128

基于门控循环单元和胶囊特征的文本情感分析

杨云龙, 孙建强, 宋国超

山东科技大学计算机科学与工程学院, 山东青岛 266590

收稿日期:2020-02-12 修回日期:2020-03-28 发布日期:2020-03-31 出版日期:2020-09-10
通讯作者: 杨云龙
作者简介:杨云龙(1995-),男,山东潍坊人,硕士研究生,CCF会员,主要研究方向:自然语言处理、情感分析、文本分类;孙建强(1996-),男,山东德州人,硕士研究生,主要研究方向:人工智能、知识图谱;宋国超(1994-),女,山东烟台人,硕士研究生,主要研究方向:位置隐私保护、轨迹数据发布。
基金资助:
山东科技大学研究生创新项目（SDKDYC190225）。

Text sentiment analysis based on gated recurrent unit and capsule features

YANG Yunlong, SUN Jianqiang, SONG Guochao

College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao Shandong 266590, China

Received:2020-02-12 Revised:2020-03-28 Online:2020-03-31 Published:2020-09-10
Supported by:
This work is partially supported by the Graduate Innovation Funds of Shandong University of Science and Technology (SDKDYC190225).

摘要/Abstract

摘要： 针对简单的循环神经网络（RNN）无法长时间记忆信息和单一的卷积神经网络（CNN）缺乏捕获文本上下文语义的能力的问题，为提升文本分类的准确率，提出一种门控循环单元（GRU）和胶囊特征融合的情感分析模型G-Caps。首先通过GRU捕捉文本的上下文全局特征，获得整体标量信息；其次在初始胶囊层将捕获的信息通过动态路由算法进行迭代，获取到表示文本整体属性的向量化的特征信息；最后在主胶囊部分进行特征间的组合以求获得更准确的文本属性，并根据各个特征的强度大小分析文本的情感极性。在基准数据集MR上进行的实验的结果表明，与初始卷积滤波器的CNN（CNN+INI）和批判学习的CNN（CL_CNN）方法相比，G-Caps的分类准确率分别提升了3.1个百分点和0.5个百分点。由此可见，G-Caps模型有效地提高了实际应用中文本情感分析的准确性。

关键词: 情感分析, 权重共享, 胶囊模型, 门控循环单元动态路由, 文本属性

Abstract: Aiming at the problems that simple Recurrent Neural Network (RNN) cannot memorize information for a long time and single Convolutional Neural Network (CNN) lacks the ability to capture the semantics of text context, in order to improve the accuracy of text classification, a sentiment analysis model G-Caps (Gated Recurrent Unit-Capsule) was proposed, which combines Gated Recurrent Unit (GRU) and capsule features. First, the contextual global features of the text were captured through GRU in order to obtain the global scalar information. Second, the captured information was iterated through the dynamic routing algorithm at the initial capsule layer to obtain the vectorized feature information representing the overall attributes of the text. Finally, the features were combined in the main capsule part to obtain more accurate text attributes, and the sentiment polarity of the text was analyzed according to the intensity of each feature. Experimental results on the benchmark dataset MR (Movie Reviews) showed that compared with the CNN + INI (Convolutional Neural Network + Initializing convolutional filters) and CL_CNN (Critic Learning_Convolutional Neural Network) methods, G-Caps had the classification accuracy increased by 3.1 percentage points and 0.5 percentage points respectively. It can be seen that the G-Caps model effectively improves the accuracy of text sentiment analysis in practice.

Key words: sentiment analysis, weight sharing, capsule model, Gated Recurrent Unit (GRU) dynamic routing, text attribute

中图分类号:

杨云龙, 孙建强, 宋国超. 基于门控循环单元和胶囊特征的文本情感分析[J]. 计算机应用, 2020, 40(9): 2531-2535.

YANG Yunlong, SUN Jianqiang, SONG Guochao. Text sentiment analysis based on gated recurrent unit and capsule features[J]. Journal of Computer Applications, 2020, 40(9): 2531-2535.

参考文献

[1] SOCHER R, PENNINGTON J, HUANG E H, et al. Semisupervised recursive autoencoders for predicting sentiment distributions[C]//Proceedings of the 2011 Conference on empirical methods in Natural Language Processing. Stroudsburg, PA:Association for Computational Linguistics,2011:151-161.
[2] SOCHER R,PERELYGIN A,WU J,et al. Recursive deep models for semantic compositionality over a sentiment treebank[C]//Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. Stroudsburg,PA:Association for Computational Linguistics,2013:1631-1642.
[3] MIKOLOV T. Statistical language models based on neural networks[D]. Brno:Brno University of Technology,2012:47-61.
[4] TAI K S, SOCHER R, MANNING C D. Improved semantic representations from tree-structured long short-term memory networks[EB/OL].[2019-09-07]. https://arxiv.org/pdf/1503.00075.pdf.
[5] KALCHBRENNER N, GREFENSTETTE E, BLUNSOM P. A convolutional neural network for modelling sentences[EB/OL].[2019-09-07]. https://arxiv.org/pdf/1404.2188.pdf.
[6] KIM Y. Convolutional neural networks for sentence classification[EB/OL].[2019-09-13]. https://arxiv.org/pdf/1408.5882.pdf.
[7] 刘书齐, 王以松, 陈攀峰. 基于CNN-ATTBiLSTM的文本情感分析[J]. 贵州大学学报(自然科学版),2019,36(2):85-89.(LIU S Q,WANG Y S,CHEN P F. A CNN-ATTBiLSTM model neural network for text sentiment analysis[J]. Journal of Guizhou University(Natural Sciences),2019,36(2):85-89.)
[8] 王丽亚, 刘昌辉, 蔡敦波, 等. 基于CNN-BiLSTM网络引入注意力模型的文本情感分析[J]. 武汉工程大学学报,2019,41(4):386-391.(WANG L Y,LIU C H,CAI D B,et al. Text sentiment analysis based on CNN-BiLSTM network and attention model[J]. Journal of Wuhan Institute of Technology,2019,41(4):386-391.)
[9] ZHANG B,XU X,LI X,et al. Sentiment analysis through critic learning for optimizing convolutional neural networks with rules[J]. Neurocomputing,2019,356:21-30.
[10] YIN W,SCHÜTZE H,XIANG B,et al. ABCNN:attention-based convolutional neural network for modeling sentence pairs[J]. Transactions of the Association for Computational Linguistics, 2016,4:259-272.
[11] ZHOU P,SHI W,TIAN J,et al. Attention-based bidirectional long short-term memory networks for relation classification[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics,2016:207-212.
[12] 杨开漠, 吴明芬, 陈涛. 广义文本情感分析综述[J]. 计算机应用,2019,39(S2):6-14.(YANG K M,WU M F,CHEN T. Generalized text sentiment analysis review[J]. Journal of Computer Applications,2019,39(S2):6-14.)
[13] HOCHREITER S,SCHMIDHUBER J. Long short-term memory[J]. Neural Computation,1997,9(8):1735-1780.
[14] 蔡国永, 林强, 任凯琪. 基于域对抗网络和BERT的跨领域文本情感分析[J]. 山东大学学报(工学版),2020,50(1):1-7,20. (CAI G Y,LIN Q,REN K Q. Cross-domain text sentiment classification based on domain-adversarial network and BERT[J]. Journal of Shandong University(Engineering Science),2020,50(1):1-7,20.)
[15] KIRITCHENKO S,ZHU X,CHERRY C,et al. NRC-Canada-2014:detecting aspects and sentiment in customer reviews[C]//Proceedings of the 8th International Workshop on Semantic Evaluation. Stroudsburg, PA:Association for Computational Linguistics,2014:437-442.
[16] 林江豪, 顾也力, 周咏梅, 等. 基于表情符号的情感词典的构建研究[J]. 计算机技术与发展,2019,29(6):181-185.(LIN J H,GU Y L,ZHOU Y M,et al. Research on building sentiment lexicon based on emoticons[J]. Computer Technology and Development,2019,29(6):181-185.)
[17] QIAN Q,HUANG M,LEI J,et al. Linguistically regularized LSTM for sentiment classification[EB/OL].[2019-08-18]. https://arxiv.org/pdf/1611.03949.pdf.
[18] VO D T,ZHANG Y. Don't count,predict! an automatic approach to learning sentiment lexicons for short text[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics,2016:219-224.
[19] WANG Y,HUANG M,ZHU X,et al. Attention-based LSTM for aspect-level sentiment classification[C]//Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA:Association for Computational Linguistics, 2016:606-615.
[20] WANG Y,SUN A,HAN J,et al. Sentiment analysis by capsules[C]//Proceedings of the 2018 World Wide Web Conference. Republic and Canton of Geneva,CHE:International World Wide Web Conferences Steering Committee,2018:1165-1174.
[21] SABOUR S,FROSST N,HINTON G E. Dynamic routing between capsules[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook,NY:Curran Associates Inc.,2017:3856-3866.
[22] PANG B,LEE L. Movie review data[EB/OL].[2019-10-08]. https://www.cs.cornell.edu/people/pabo/movie-review-data/.
[23] WANG J,YU L C,LAI K R,et al. Dimensional sentiment analysis using a regional CNN-LSTM model[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics,2016:225-230.
[24] CHEN T,XU R,HE Y,et al. Improving sentiment analysis via sentence type classification using BiLSTM-CRF and CNN[J]. Expert Systems with Applications,2017,72:221-230.
[25] HU Z,MA X,LIU Z,et al. Harnessing deep neural networks with logic rules[EB/OL].[2019-09-25]. https://arxiv.org/pdf/1603.06318.pdf.
[26] LI S,ZHAO Z,LIU T,et al. Initializing convolutional filters with semantic features for text classification[C]//Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA:Association for Computational Linguistics,2017:1884-1889.

基于门控循环单元和胶囊特征的文本情感分析

Text sentiment analysis based on gated recurrent unit and capsule features

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	薛凯鹏, 徐涛, 廖春节. 融合自监督和多层交叉注意力的多模态情感分析网络[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2387-2392.
[2]	柯添赐, 刘建华, 孙水华, 郑智雄, 蔡子杰. 融合强关联依赖和简洁语法的方面级情感分析模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1786-1795.
[3]	高龙涛, 李娜娜. 基于方面感知注意力增强的方面情感三元组抽取[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1049-1057.
[4]	杨先凤, 汤依磊, 李自强. 基于交替注意力机制和图卷积网络的方面级情感分析模型[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1058-1064.
[5]	郭磊, 贾真, 李天瑞. 面向方面级情感分析的交互式关系图注意力网络[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 696-701.
[6]	李言博, 何庆, 陆顺意. 融合语义和句法信息的方面情感三元组抽取[J]. 《计算机应用》唯一官方网站, 2024, 44(10): 3275-3280.
[7]	罗俊豪, 朱焱. 用于未对齐多模态语言序列情感分析的多交互感知网络[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 79-85.
[8]	陈丽安, 过弋. 融合个体偏差信息的文本情感分析模型[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 145-151.
[9]	张心月, 刘蓉, 魏驰宇, 方可. 融合提示知识的方面级情感分析方法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2753-2759.
[10]	衡红军, 杨鼎诚. 知识增强的方面词交互图神经网络[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2412-2419.
[11]	何嘉明, 杨巨成, 吴超, 闫潇宁, 许能华. 基于多模态图卷积神经网络的行人重识别方法[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2182-2189.
[12]	郑智雄, 刘建华, 孙水华, 徐戈, 林鸿辉. 融合多窗口局部信息的方面级情感分析模型[J]. 《计算机应用》唯一官方网站, 2023, 43(6): 1796-1802.
[13]	方澄, 李贝, 韩萍, 吴琼. 基于语法依存图的中文微博细粒度情感分类[J]. 《计算机应用》唯一官方网站, 2023, 43(4): 1056-1061.
[14]	徐丹, 龚红仿, 罗容容. 具有方面项和上下文表示的方面情感分析[J]. 《计算机应用》唯一官方网站, 2023, 43(10): 3086-3092.
[15]	刘欢, 窦全胜. 嵌入不同邻域表征的方面级情感分析模型[J]. 《计算机应用》唯一官方网站, 2023, 43(1): 37-44.