面向短文本情感分类的端到端对抗变分贝叶斯方法

doi:10.11772/j.issn.1001-9081.2020010048

计算机应用 ›› 2020, Vol. 40 ›› Issue (9): 2536-2542.DOI: 10.11772/j.issn.1001-9081.2020010048

面向短文本情感分类的端到端对抗变分贝叶斯方法

尹春勇, 章荪

南京信息工程大学计算机与软件学院, 南京 210044

收稿日期:2020-01-17 修回日期:2020-04-24 发布日期:2020-05-06 出版日期:2020-09-10
通讯作者: 尹春勇
作者简介:尹春勇(1977-),男,山东潍坊人,教授,博士生导师,博士,主要研究方向:网络空间安全、大数据挖掘、隐私保护、人工智能、新型计算;章荪(1994-),男,安徽六安人,博士研究生,主要研究方向:机器学习、数据挖掘、文本分类。
基金资助:
国家自然科学基金资助项目（61772282）。

End-to-end adversarial variational Bayes method for short text sentiment classification

YIN Chunyong, ZHANG Sun

School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing Jiangsu 210044, China

Received:2020-01-17 Revised:2020-04-24 Online:2020-05-06 Published:2020-09-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61772282).

摘要/Abstract

摘要： 针对文本情感分析中文本过短而导致的分类准确度低的问题，结合对抗学习和变分推断提出一种端到端的短文本情感分类模型。首先，使用谱规范化技术解决了判别器在训练过程中的震荡问题；然后，添加额外的分类模型来指导推断模型的更新；其次，使用对抗变分贝叶斯（AVB）模型提取短文本的主题特征；最后，使用三次注意力机制来融合主题特征与预训练词向量特征进行分类。通过在一个产品评论和两个微博数据集上的实验结果证明，所提模型较基于自注意力的双向长短期记忆网络（BiLSTM-SA）在分类准确度上分别提高了2.9、2.2和8.4个百分点。由此可见，该模型适用于挖掘社交短文本中的情感和观点信息，对舆情发现、用户反馈、质量监督和其他相关领域具有重要的意义。

关键词: 对抗学习, 情感分类, 短文本, 变分推断, 主题模型

Abstract: Concerning the problem of low accuracy in sentiment classification caused by short text, an end-to-end short text sentiment classifier was proposed based on adversarial learning and variational inference. First, the spectrum normalization technology was employed to alleviate the vibration of discriminator in training process. Second, an additional classifier was utilized to guide the updating of the inference model. Third, the Adversarial Variational Bayes (AVB) was used to extract the topic features of the short text. Finally, topic features and pre-trained word vector features were fused by three times of attention mechanism in order to realize the classification. Experimental results on one product review and two micro-blog datasets show that the proposed model improves the accuracy by 2.9, 2.2 and 8.4 percentage points respectively compared to the Bidirectional Long Short-Term Memory network based on Self-Attention (BiLSTM-SA). It can be seen that the proposed model can be applied to mine sentiments and opinions in social short texts, which is significant for public opinion discovery, user feedback, quality supervision and other related fields.

Key words: adversarial learning, sentiment classification, short text, variational inference, topic model

中图分类号:

TP391.1

尹春勇, 章荪. 面向短文本情感分类的端到端对抗变分贝叶斯方法[J]. 计算机应用, 2020, 40(9): 2536-2542.

YIN Chunyong, ZHANG Sun. End-to-end adversarial variational Bayes method for short text sentiment classification[J]. Journal of Computer Applications, 2020, 40(9): 2536-2542.

参考文献

[1] 刘德喜, 聂建云, 万常选, 等. 基于分类的微博新情感词抽取方法和特征分析[J]. 计算机学报,2018, 41(7):1574-1597.(LIU D X,NIE J Y,WANG C X,et al. A classification based sentiment words extracting method from microblogs and its feature engineering[J]. Chinese Journal of Computers,2018,41(7):1574-1597.)
[2] GIACHANOU A,CRESTANI F. Like it or not:a survey of twitter sentiment analysis methods[J]. ACM Computing Surveys,2016, 49(2):No. 28.
[3] YADOLLAAHI A,SHAHRAKI A G,ZAIANE O R. Current state of text sentiment analysis from opinion to emotion mining[J]. ACM Computing Surveys,2017,50(2):No. 25.
[4] 曾义夫, 蓝天, 吴祖峰, 等. 基于双记忆注意力的方面级别情感分类模型[J]. 计算机学报,2019, 42(8):1845-1857.(ZENG Y F,LAN T,WU Z F,et al. Bi-memory based attention model for aspect level sentiment classification[J]. Chinese Journal of Computers,2019,42(8):1845-1857.)
[5] 许银洁, 孙春华, 刘业政. 考虑用户特征的主题情感联合模型[J]. 计算机应用,2018,38(5):1261-1266.(XU Y J,SUN C H, LIU Y Z. Joint sentiment/topic model integrating user characteristics[J]. Journal of Computer Applications,2018,38(5):1261-1266.)
[6] PASSALIS N, TEFAS A. Learning bag-of-embedded-words representations for textual information retrieval[J]. Pattern Recognition,2018,81:254-267.
[7] DEVLIN J,CHANG M W,LEE K,et al. BERT:pre-training of deep bidirectional transformers for language understanding[C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies. Stroudsburg, PA:Association for Computational Linguistics,2019:4171-4186.
[8] WANG J,WANG Z,ZHANG D,et al. Combining knowledge with deep convolutional neural networks for short text classification[C]//Proceedings of the 26th International Joint Conference on Artificial Intelligence. Palo Alto,CA:AAAI,2017:2915-2921.
[9] JIA C,CARSIN M B,WANG X,et al. Concept decompositions for short text clustering by identifying word communities[J]. Pattern Recognition,2018,76:691-703.
[10] CHEN J,HU Y,LIU J,et al. Deep short text classification with knowledge powered attention[C]//Proceedings of the 31st AAAI Conference on Artificial Intelligence. Palo Alto,CA:AAAI, 2019:6252-6259.
[11] KINGMA D P,WELLING M. Auto-encoding variational Bayes[EB/OL].[2019-12-20]. https://arxiv.org/pdf/1312.6114.pdf.
[12] MIAO Y,YU L,BLUNSOM P. Neural variational inference for text processing[C]//Proceedings of the 33rd International Conference on Machine Learning. New York:JMLR. org,2016:1727-1736.
[13] SRIVASTAVA A,SUTTON C. Autoencoding variational inference for topic models[EB/OL].[2019-03-04]. https://arxiv.org/pdf/1703.01488.pdf.
[14] GOODFELLOW I J,POUGET-ABADIE J,MIRZA M,et al. Generative adversarial nets[C]//Proceedings of the 27th International Conference on Neural Information Processing Systems. Cambridge:MIT Press,2014:2672-2680.
[15] MESCHEDER L, NOWOZIN S, GEIGER A. Adversarial variational Bayes:unifying variational autoencoders and generative adversarial networks[C]//Proceedings of the 34th International Conference on Machine Learning. New York:JMLR. org,2017:2391-2400.
[16] WANG R,ZHOU D,HE Y. ATM:adversarial-neural topic model[J]. Information Processing and Management,2019,56(6):No. 102098.
[17] ARJOVSKY M, CHINTALA S, BOTTOU L. Wasserstein generative adversarial networks[C]//Proceedings of the 34th International Conference on Machine Learning. New York:JMLR. org,2017:214-223.
[18] GULRAJANI I,AHEMD F,ARJOVSKY M,et al. Improved training of Wasserstein GANs[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook,NY:Curran Associates Inc.,2017:5767-5777.
[19] MIYATO T, KATAOKA T, KOYAMA M, et al. Spectral normalization for generative adversarial networks[EB/OL].[2020-02-16]. https://arxiv.org/pdf/1802.05957.pdf.
[20] BAHDANAU D,CHO K,BENGIO Y. Neural machine translation by jointly learning to align and translate[EB/OL].[2019-05-19]. https://arxiv.org/pdf/1409.0473.pdf.
[21] HU J,SHEN L,SUN G. Squeeze-and-excitation networks[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:7132-7141.
[22] VASWANI A,SHAZEER N,PARMAR N,et al. Attention is all you need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook,NY:Curran Associates Inc.,2017:6000-6010.
[23] LIN Z,FENG M,DOS SANTOS C N,et al. A structured selfattentive sentence embedding[EB/OL].[2019-03-09]. https://arxiv.org/pdf/1703.03130.pdf.
[24] LI S,ZHAO Z,HU R,et al. Analogical reasoning on Chinese morphological and semantic relations[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics, 2018:138-143.
[25] KIM Y. Convolutional neural networks for sentence classification[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Stroudsburg,PA:Association for Computational Linguistics,2014:1746-1751.
[26] LEE J Y,DERNONCOURT F. Sequential short-text classification with recurrent and convolutional neural networks[C]//Proceedings of the 2016 Annual Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies. Stroudsburg,PA:Association for Computational Linguistics,2016:515-520.

面向短文本情感分类的端到端对抗变分贝叶斯方法

End-to-end adversarial variational Bayes method for short text sentiment classification

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	黄于欣, 徐佳龙, 余正涛, 侯书楷, 周家啟. 基于生成提示的无监督文本情感转换方法[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2667-2673.
[2]	曹铉, 罗天健. 运动想象脑电信号的跨被试动态多域对抗学习方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 645-653.
[3]	陆辉, 黄瑞章, 薛菁菁, 任丽娜, 林川. 深度动态文本聚类模型DDDC[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2370-2375.
[4]	王雨, 袁玉波, 过弋, 张嘉杰. 情感增强的对话文本情绪识别模型[J]. 《计算机应用》唯一官方网站, 2023, 43(3): 706-712.
[5]	钟磊, 周允升, 余敦辉, 崔海波. 基于亲和力与研究方向覆盖率的审稿人推荐算法[J]. 《计算机应用》唯一官方网站, 2023, 43(2): 430-436.
[6]	曹建乐, 李娜娜. 基于多层次注意力的语义增强情感分类模型[J]. 《计算机应用》唯一官方网站, 2023, 43(12): 3703-3710.
[7]	刘拥民, 杨钰津, 罗皓懿, 黄浩, 谢铁强. 基于双向循环生成对抗网络的无线传感网入侵检测方法[J]. 《计算机应用》唯一官方网站, 2023, 43(1): 160-168.
[8]	杨世刚, 刘勇国. 融合语料库特征与图注意力网络的短文本分类方法[J]. 《计算机应用》唯一官方网站, 2022, 42(5): 1324-1329.
[9]	徐雪敏, 张秀国, 肖媛元, 曹志英. 基于优化的灰狼算法的大规模Web服务组合[J]. 《计算机应用》唯一官方网站, 2022, 42(10): 3162-3169.
[10]	杨丰瑞, 霍娜, 张许红, 韦巍. 基于注意力机制的主题扩展情感对话生成[J]. 计算机应用, 2021, 41(4): 1078-1083.
[11]	成科扬, 孟春运, 王文杉, 师文喜, 詹永照. 解耦表征学习研究进展[J]. 《计算机应用》唯一官方网站, 2021, 41(12): 3409-3418.
[12]	邓钰, 李晓瑜, 崔建, 刘齐. 用于短文本情感分类的多头注意力记忆网络[J]. 《计算机应用》唯一官方网站, 2021, 41(11): 3132-3138.
[13]	杨威亚, 余正涛, 高盛祥, 宋燃. 基于跨语言神经主题模型的汉越新闻话题发现方法[J]. 计算机应用, 2021, 41(10): 2879-2884.
[14]	朱思淼, 魏世伟, 魏思恒, 余敦辉. 基于弹幕情感分析和主题模型的视频推荐算法[J]. 计算机应用, 2021, 41(10): 2813-2819.
[15]	袁景凌, 丁远远, 潘东行, 李琳. 基于时序和上下文特征的中文隐式情感分类模型[J]. 计算机应用, 2021, 41(10): 2820-2828.