Sentiment analysis based on sentiment lexicon and stacked residual Bi-LSTM network

doi:10.11772/j.issn.1001-9081.2021071179

Abstract

Abstract:

Sentiment analysis， as a subdivision of Natural Language Processing（NLP）， has experienced the development of using sentiment lexicon， machine learning and deep learning to analyze. According to the problem of low accuracy， over fitting phenomenon in training process and low coverage， large workload when compiling the sentiment lexicon when using the generalized deep learning model as a text classifier to analysis of Web text reviews in a specific field， a sentiment analysis model based on sentiment lexicon and stacked residual Bidirectional Long Short-Term Memory （Bi-LSTM） network was proposed. Firstly， the sentiment words in the sentiment lexicon were designed to cover the professional words in the research field of "educational robot"， thereby making up for the lack of accuracy of Bi-LSTM model in analyzing such texts. Then， Bi-LSTM and SnowNLP were used to reduce the volume of compilation of the sentiment lexicon. The memory gate and forget gate structures of Long Short-Term Memory （LSTM） network were able to ensure that the relevance of the words before and after in the comment text were fully considered with some analyzed words selected to be forgotten at the same time， thereby avoiding the problem of gradient explosion during the back propagation. After the introduction of the stacked residual Bi-LSTM， not only the number of layers of the model was deepened to 8， but also the "degradation" problem caused by the residual network stacking LSTM was avoided. Finally， by setting and adjusting the score weights of the two parts appropriately， and the sigmoid activation function was used to normalize the total score to the interval of ［0，1］. According to the interval division of ［0，0.5］ and （0.5，1］， negative and positive emotions were represented respectively， and sentiment classification was completed. Experimental results show that the sentiment classification accuracy of the proposed classification model for the reviews dataset about "educational robot" is improved by about 4.5 percentage points compared with the standard LSTM model and by about 2.0 percentage points compared with the BERT （Bidirectional Encoder Representation from Transformers）. In conclusion， the sentiment classification model based on sentiment lexicon and deep learning classification model was generalized by the proposed model， and by modifying the sentiment words in the lexicon and appropriately adjusting the layer number and the structure of the deep learning model， the proposed model can be applied to accurate sentiment analysis of shopping reviews of all kinds of goods in e-commerce platform， thereby helping enterprises to understand the consumers’ shopping psychology and the market demand， as well as providing consumers with a reference standard for the quality of goods.

Key words: Bidirectional Long Short-Term Memory (Bi-LSTM) network, shopping review, sentiment analysis, stacked residual, sentiment lexicon

摘要：

情感分析作为自然语言处理（NLP）的细分研究方向经历了使用情感词典、机器学习和深度学习分析的发展过程。针对使用一般化的深度学习模型作为文本分类器对于特定领域的网络评论类型的文本的分析的精准度较低，训练时发生过拟合现象以及情感词典覆盖率低、编纂工作量大的问题，提出了基于情感词典和堆叠残差的双向长短期记忆（Bi-LSTM）网络的情感分析模型。首先，借助情感词典中情感词的设计覆盖“教育机器人”研究领域内的专业词汇，从而弥补Bi-LSTM模型在分析此类文本时精准度的不足；然后，使用Bi-LSTM和SnowNLP来降低情感词典的编纂体量。长短期记忆（LSTM）网络的“记忆门”“遗忘门”结构可以在保证充分考虑评论文本中的前后词语的关联性的同时，适时选择遗忘一些已分析词语，从而避免反向传播时的梯度爆炸问题。而在将堆叠残差的Bi-LSTM引入后，不仅使得模型的层数加深至8层，而且还使残差网络避免了叠加LSTM时会导致的“退化”问题；最后，通过适当设置和调整两部分的得分权重，并将总分使用Sigmoid激活函数标准化到［0，1］的区间上，按照［0，0.5］，（0.5，1］的区间划分分别表示负面和正面情绪，完成情感分类。实验结果表明，在“教育机器人”评论数据集中，所提模型对于情感分类准确率相较于标准的LSTM模型提升了约4.5个百分点，相较于BERT提升了约2.0个百分点。综上，所提模型将基于情感词典和深度学习模型的情感分类方法一般化；而通过修改情感词典中的情感词汇并适当调整深度学习模型的结构和层数，所提模型可以应用于电子商务平台中各类商品的购物评价的精确情感分析，从而帮助企业洞悉消费者的购物心理和市场需求，同时也可以为消费者提供商品质量的一种参考标准。

关键词: 双向长短期记忆网络, 购物评论, 情感分析, 堆叠残差, 情感词典

CLC Number:

TP391.1

Haoran LUO, Qing YANG. Sentiment analysis based on sentiment lexicon and stacked residual Bi-LSTM network[J]. Journal of Computer Applications, 2022, 42(4): 1099-1107.

罗浩然, 杨青. 基于情感词典和堆叠残差的双向长短期记忆网络的情感分析[J]. 《计算机应用》唯一官方网站, 2022, 42(4): 1099-1107.

Figures/Tables 14

References 20

1	张严，李天瑞. 面向评论的方面级情感分析综述［J］. 计算机科学， 2020， 47（6）：194-200. 10.11896/jsjkx.200200127
	ZHANG Y， LI T R. Review of comment-oriented aspect-based sentiment analysis［J］. Computer Science， 2020， 47（6）：194-200. 10.11896/jsjkx.200200127
2	洪巍，李敏. 文本情感分析方法研究综述［J］. 计算机工程与科学， 2019， 41（4）：750-757. 10.3969/j.issn.1007-130X.2019.04.024
	HONG W， LI M. A review： text sentiment analysis methods［J］. Computer Engineering and Science， 2019， 41（4）：750-757. 10.3969/j.issn.1007-130X.2019.04.024
3	KIM S M， HOVY E. Identifying and analyzing judgment opinions［C］// Proceedings of the 2006 Human Language Technology Conference of the NAACL， Main Conference. Stroudsburg， PA： Association for Computational Linguistics， 2006：200-207. 10.3115/1220835.1220861
4	李勇敢，周学广，孙艳，等. 中文微博情感分析研究与实现［J］. 软件学报， 2017， 28（12）：3183-3205. 10.13328/j.cnki.jos.005283
	LI Y G， ZHOU X G， SUN Y， et al. Research and implementation of Chinese microblog sentiment classification［J］. Journal of Software， 2017， 28（12）：3183-3205. 10.13328/j.cnki.jos.005283
5	TAN S B， CHENG X Q， WANG Y F， et al. Adapting Naive Bayes to domain adaptation for sentiment analysis［C］// Proceedings of the 2009 European Conference on Information Retrieval， LNCS 5478. Berlin： Springer， 2009：337-349.
6	JOHNSON R， ZHANG T. Deep pyramid convolutional neural networks for text categorization［C］// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics （Volume1： Long Papers）. Stroudsburg， PA： Association for Computational Linguistics， 2017：562-570. 10.18653/v1/p17-1052
7	RAJASEGARAN J， JAYASUNDARA V， JAYASEKARA S， et al. DeepCaps： going deeper with capsule networks［C］// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2019： 10717-10725. 10.1109/cvpr.2019.01098
8	ZHANG M， PALADE V， WANG Y， et al. Attention-based word embeddings using artificial bee colony algorithm for aspect-level sentiment classification［J］. Information Sciences， 2021， 545：713-738. 10.1016/j.ins.2020.09.038
9	KIM D， SEO D， CHO S， et al. Multi-co-training for document classification using various document representations： TF-IDF， LDA， and Doc2Vec［J］. Information Sciences， 2019， 477：15-29. 10.1016/j.ins.2018.10.006
10	李志强，潘苏含，戴娟，等. 一种改进的TextRank关键词提取算法［J］. 计算机技术与发展， 2020， 30（3）：77-81. 10.3969/j.issn.1673-629X.2020.03.015
	LI Z Q， PAN S H， DAI J， et al. An improved TextRank keyword extraction algorithm［J］. Computer Technology and Development， 2020， 30（3）：77-81. 10.3969/j.issn.1673-629X.2020.03.015
11	PARK J. Framework for sentiment-driven evaluation of customer satisfaction with cosmetics brands［J］. IEEE Access， 2020， 8：98526-98538. 10.1109/access.2020.2997522
12	BEHERA R K， JENA M， RATH S K， et al. Co-LSTM： convolutional LSTM model for sentiment analysis in social big data［J］. Information Processing and Management， 2021， 58（1）： No.102435. 10.1016/j.ipm.2020.102435
13	PARK S H， KIM B， KANG C M， et al. Sequence-to-sequence prediction of vehicle trajectory via LSTM encoder-decoder architecture［C］// Proceeding of the 2018 IEEE Intelligent Vehicles Symposium. Piscataway： IEEE， 2018： 1672-1678. 10.1109/ivs.2018.8500658
14	HE K M， ZHANG X Y， REN S Q， et al. Spatial pyramid pooling in deep convolutional networks for visual recognition［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2015， 37（9）： 1904-1916. 10.1109/tpami.2015.2389824
15	SZEGEDY C， VANHOUCKE V， IOFFE S， et al. Rethinking the inception architecture for computer vision［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2016： 2818-2826. 10.1109/cvpr.2016.308
16	HE K M， ZHANG X Y， REN S Q， et al. Deep residual learning for image recognition［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2016： 770-778. 10.1109/cvpr.2016.90
17	景丽，李曼曼，何婷婷. 结合扩充词典与自监督学习的网络评论情感分类［J］. 计算机科学， 2020， 47（S2）：78-82， 91. 10.11896/jsjkx.200400061
	JING L， LI M M， HE T T. Sentiment classification of network reviews combining extended dictionary and self-supervised learning［J］. Computer Science， 2020， 47（S2）：78-82， 91. 10.11896/jsjkx.200400061
18	HAN C， LEI Y， XIE Y， et al. Learning smooth representations with generalized softmax for unsupervised domain adaptation［J］. Information Sciences， 2021， 544：415-426. 10.1016/j.ins.2020.08.075
19	LEE G T， KIM C O， SONG M. Semisupervised sentiment analysis method for online text reviews［J］. Journal of Information Science， 2021， 47（3）： 387-403. 10.1177/0165551520910032
20	徐健锋，许园，许元辰，等. 基于语义理解和机器学习的混合的中文文本情感分类算法框架［J］. 计算机科学， 2015， 42（6）：61-66. 10.11896/j.issn.1002-137X.2015.06.014
	XU J F， XU Y， XU Y C， et al. Hybrid algorithm framework for sentiment classification of Chinese based on semantic comprehension and machine learning［J］. Computer Science， 2015， 42（6）：61-66. 10.11896/j.issn.1002-137X.2015.06.014

维度	情感极性	情感词汇
学习	积极	丰富、充足、充裕、富饶、丰硕、足够、复杂、科学、寓教于乐、教育、教诲、教导、培养、培育、造就、教训、训导、训诲、指导、训诫、教化、教学、教授、熏陶、教养、进步、前进、提高、上进、发展、向前、先进、优秀、领先、成长、兴趣、爱好、兴致、兴会、兴致、风趣、兴味、乐趣、意思、趣味、有趣
学习	消极	缺乏、不足、枯竭、欠缺、单调、短缺、贫乏、缺少、匮乏、退步、失败、失利、倒退、无趣、乏味、枯燥、无聊、重复、机械、僵硬、过时、落伍、落后、没用、没价值、玩具、应付、凑合、没意义
影视	积极	亮眼、落落大方、清晰、舒服、亮、清晰、高分辨率、高刷屏、窄、单手、显露、清晰、了解、清爽、分明、显现、明晰、明确、明了、真切、鲜艳、畅通、贯通、通畅、畅达、优秀、流畅、大喇叭、一清二楚
影视	消极	模糊、暗淡、暗、不清楚、糊、低、细、纤细、微弱、弱、薄、灰暗、黯淡、费眼、费劲、累、麻烦、看不清、卡、卡顿、暂停、波动、闪退、出错
交互	积极	VR、智能、聪明、智慧、科学、有才、现代化、伶俐、灵巧、机警、聪敏、机智、机灵、聪慧、讨喜、喜欢、心爱、宠爱、笃爱、嗜好、喜好、热爱、爱好、喜爱、可爱、人缘、厉害、能干、复杂、人性化、亲切、活灵活现、仿真、AI、善解人意、懂、明白、一清二楚、游刃有余、逗乐、搞笑、欢乐、开玩笑、有意思、有趣
交互	消极	愚蠢、笨、愚笨、愚昧、笨拙、拙笨、蒙昧、无知、迂曲、痴呆、呆笨、鲁钝、无能、蠢货、蠢、反人类、弱智、低能、低下、无聊、傻乎乎、傻、乏味、玩具、智障、人工智障、弱、毫无用处、无用、不懂、不明白、答非所问
用户体验	积极	美丽、优雅、高贵、大、轻薄、漂亮、好看、颜值高、耐看、讲究、贵气、个性、创意感、设计感、文艺、怀旧、年轻、复古、写意、用心、文化、底蕴、亲切感、靓、靓丽、合适、舒服、上手、小巧、亮眼、帅气、精神、一流、顺利、亲切、到位、得体、爱心、和蔼、尊敬、恭敬、彬彬有礼、称呼、礼貌、顺畅、无压力、神速、急速、迅疾、快捷、敏捷、飞快、疾速、急剧、迅速、赶紧、赶快、飞速、火速、好说话
用户体验	消极	丑、丑陋、笨重、厚、单调、寒酸、老套、俗气、土、土气、马虎、粗糙、粗制滥造、伪劣、下等、次、垃圾、不行、老气、低档次、没档次、不敢恭维、等待、漫长、心累、无力、吐槽、消极、冷漠、冷落、疏远、冷酷、淡漠、漠视、冷淡、冷傲、忽视、傲慢、倨傲、狂妄、高傲、无礼、高慢、缓慢、迟缓、舒缓、怠缓、慢慢、舒徐、平缓
价格	积极	实在、优惠、物超所值、亲民、厚道、廉价、对得起、合适、适中、可以的、到位、低廉、便宜、童叟无欺、公道、经济、高性价比、美丽、透明、完美、通透、接地气
价格	消极	贵、高昂、夸张、坑人、坑爹、不当、不值、涨价、亏了、亏本、不赚昂贵、不菲、离谱、吓人、要命

维度	情感极性	情感词汇
学习	积极	丰富、充足、充裕、富饶、丰硕、足够、复杂、科学、寓教于乐、教育、教诲、教导、培养、培育、造就、教训、训导、训诲、指导、训诫、教化、教学、教授、熏陶、教养、进步、前进、提高、上进、发展、向前、先进、优秀、领先、成长、兴趣、爱好、兴致、兴会、兴致、风趣、兴味、乐趣、意思、趣味、有趣
学习	消极	缺乏、不足、枯竭、欠缺、单调、短缺、贫乏、缺少、匮乏、退步、失败、失利、倒退、无趣、乏味、枯燥、无聊、重复、机械、僵硬、过时、落伍、落后、没用、没价值、玩具、应付、凑合、没意义
影视	积极	亮眼、落落大方、清晰、舒服、亮、清晰、高分辨率、高刷屏、窄、单手、显露、清晰、了解、清爽、分明、显现、明晰、明确、明了、真切、鲜艳、畅通、贯通、通畅、畅达、优秀、流畅、大喇叭、一清二楚
影视	消极	模糊、暗淡、暗、不清楚、糊、低、细、纤细、微弱、弱、薄、灰暗、黯淡、费眼、费劲、累、麻烦、看不清、卡、卡顿、暂停、波动、闪退、出错
交互	积极	VR、智能、聪明、智慧、科学、有才、现代化、伶俐、灵巧、机警、聪敏、机智、机灵、聪慧、讨喜、喜欢、心爱、宠爱、笃爱、嗜好、喜好、热爱、爱好、喜爱、可爱、人缘、厉害、能干、复杂、人性化、亲切、活灵活现、仿真、AI、善解人意、懂、明白、一清二楚、游刃有余、逗乐、搞笑、欢乐、开玩笑、有意思、有趣
交互	消极	愚蠢、笨、愚笨、愚昧、笨拙、拙笨、蒙昧、无知、迂曲、痴呆、呆笨、鲁钝、无能、蠢货、蠢、反人类、弱智、低能、低下、无聊、傻乎乎、傻、乏味、玩具、智障、人工智障、弱、毫无用处、无用、不懂、不明白、答非所问
用户体验	积极	美丽、优雅、高贵、大、轻薄、漂亮、好看、颜值高、耐看、讲究、贵气、个性、创意感、设计感、文艺、怀旧、年轻、复古、写意、用心、文化、底蕴、亲切感、靓、靓丽、合适、舒服、上手、小巧、亮眼、帅气、精神、一流、顺利、亲切、到位、得体、爱心、和蔼、尊敬、恭敬、彬彬有礼、称呼、礼貌、顺畅、无压力、神速、急速、迅疾、快捷、敏捷、飞快、疾速、急剧、迅速、赶紧、赶快、飞速、火速、好说话
用户体验	消极	丑、丑陋、笨重、厚、单调、寒酸、老套、俗气、土、土气、马虎、粗糙、粗制滥造、伪劣、下等、次、垃圾、不行、老气、低档次、没档次、不敢恭维、等待、漫长、心累、无力、吐槽、消极、冷漠、冷落、疏远、冷酷、淡漠、漠视、冷淡、冷傲、忽视、傲慢、倨傲、狂妄、高傲、无礼、高慢、缓慢、迟缓、舒缓、怠缓、慢慢、舒徐、平缓
价格	积极	实在、优惠、物超所值、亲民、厚道、廉价、对得起、合适、适中、可以的、到位、低廉、便宜、童叟无欺、公道、经济、高性价比、美丽、透明、完美、通透、接地气
价格	消极	贵、高昂、夸张、坑人、坑爹、不当、不值、涨价、亏了、亏本、不赚昂贵、不菲、离谱、吓人、要命

一级因素	一级因素得分（天猫/科大讯飞/小度/狄刺史）	一级情感极性	二级因素	二级因素得分（天猫/科大讯飞/小度/狄刺史）	二级情感极性
学习	0.41/0.76/0.47/0.79	负正负正	教学	0.19/0.90/0.32/0.83	负正负正
			做题	0.43/0.65/0.46/0.79	负正负正
			问题	0.44/0.64/0.63/0.75	负正正正
			搜题	0.57/0.83/0.45/0.78	正正负正
影视	0.79/0.56/0.79/0.73	正正正正	视频	0.80/0.20/0.75/0.70	正负正正
			音质	0.78/0.73/0.79/0.79	正正正正
			娱乐	0.79/0.75/0.84/0.69	正正正正
交互	0.58/0.92/0.68/0.49	正正正负	语音	0.43/0.97/0.48/0.31	负正负负
			对话	0.37/0.95/0.58/0.30	负正正负
			聊天	0.53/0.88/0.64/0.43	正正正负
			回答	0.68/0.93/0.85/0.61	正正正正
			陪伴	0.88/0.89/0.83/0.84	正正正正
用户体验	0.73/0.60/0.80/0.59	正正正正	颜色	0.69/0.65/0.76/0.73	正正正正
			手感	0.74/0.51/0.80/0.52	正正正正
			尺寸	0.80/0.47/0.83/0.48	正负正负
			外形	0.69/0.76/0.80/0.63	正正正正
价格	0.75/0.55/0.81/0.70	正正正正	便宜	0.75/0.34/0.82/0.68	正负正正
价格	0.75/0.55/0.81/0.70	正正正正	性价比	0.75/0.76/0.80/0.72	正正正正

一级因素	一级因素得分（天猫/科大讯飞/小度/狄刺史）	一级情感极性	二级因素	二级因素得分（天猫/科大讯飞/小度/狄刺史）	二级情感极性
学习	0.41/0.76/0.47/0.79	负正负正	教学	0.19/0.90/0.32/0.83	负正负正
			做题	0.43/0.65/0.46/0.79	负正负正
			问题	0.44/0.64/0.63/0.75	负正正正
			搜题	0.57/0.83/0.45/0.78	正正负正
影视	0.79/0.56/0.79/0.73	正正正正	视频	0.80/0.20/0.75/0.70	正负正正
			音质	0.78/0.73/0.79/0.79	正正正正
			娱乐	0.79/0.75/0.84/0.69	正正正正
交互	0.58/0.92/0.68/0.49	正正正负	语音	0.43/0.97/0.48/0.31	负正负负
			对话	0.37/0.95/0.58/0.30	负正正负
			聊天	0.53/0.88/0.64/0.43	正正正负
			回答	0.68/0.93/0.85/0.61	正正正正
			陪伴	0.88/0.89/0.83/0.84	正正正正
用户体验	0.73/0.60/0.80/0.59	正正正正	颜色	0.69/0.65/0.76/0.73	正正正正
			手感	0.74/0.51/0.80/0.52	正正正正
			尺寸	0.80/0.47/0.83/0.48	正负正负
			外形	0.69/0.76/0.80/0.63	正正正正
价格	0.75/0.55/0.81/0.70	正正正正	便宜	0.75/0.34/0.82/0.68	正负正正
价格	0.75/0.55/0.81/0.70	正正正正	性价比	0.75/0.76/0.80/0.72	正正正正

一级因素	二级因素	积极数	消极数	积极准确率	消极准确率
学习	教学	2 175	1 664	0.88	0.87
	做题	1 997	987	0.89	0.88
	问题	2 017	753	0.86	0.87
	搜题	2 389	593	0.87	0.87
影视	视频	3 051	1 368	0.90	0.85
	音质	1 870	912	0.89	0.88
	娱乐	4 076	1 593	0.89	0.90
交互	语音	1 032	1 238	0.85	0.89
	对话	1 980	2 991	0.86	0.83
	聊天	2 016	3 757	0.84	0.85
	回答	988	2 552	0.85	0.84
	陪伴	3 560	680	0.89	0.88
用户体验	颜色	1 332	498	0.88	0.87
	手感	857	216	0.87	0.86
	尺寸	685	503	0.89	0.88
	外形	1 999	622	0.91	0.89
价格	便宜	4 057	2 315	0.92	0.89
价格	性价比	1 926	870	0.90	0.88