《计算机应用》唯一官方网站 ›› 2021, Vol. 41 ›› Issue (12): 3540-3545.DOI: 10.11772/j.issn.1001-9081.2021060963

• 第十八届中国机器学习会议(CCML 2021) • 上一篇    

基于用户传播网络与消息内容融合的谣言检测模型

薛海涛1, 王莉1(), 杨延杰1, 廉飚2   

  1. 1.太原理工大学 大数据学院,太原 030600
    2.北方自动控制技术研究所,太原 030006
  • 收稿日期:2021-05-12 修回日期:2021-06-25 接受日期:2021-07-04 发布日期:2021-12-28 出版日期:2021-12-10
  • 通讯作者: 王莉
  • 作者简介:薛海涛(1997—),男,山西介休人,硕士研究生,主要研究方向:自然语言处理、谣言检测
    杨延杰(1995—),男,山西原平人,硕士研究生,主要研究方向:自然语言处理、数据挖掘
    廉飚(1987—),男,山西太原人,硕士,主要研究方向:软件开发、数据挖掘。
  • 基金资助:
    国家自然科学基金资助项目(61872260)

Rumor detection model based on user propagation network and message content

Haitao XUE1, Li WANG1(), Yanjie YANG1, Biao LIAN2   

  1. 1.College of Data Science,Taiyuan University of Technology,Taiyuan Shanxi 030600,China
    2.North Automatic Control Technology Institute,Taiyuan Shanxi 030006,China
  • Received:2021-05-12 Revised:2021-06-25 Accepted:2021-07-04 Online:2021-12-28 Published:2021-12-10
  • Contact: Li WANG
  • About author:XUE Haitao, born in 1997, M. S. candidate. His research interests include natural language processing, rumor detection.
    YANG Yanjie, born in 1995, M. S. candidate. His research interests include natural language processing, data mining.
    LIAN Biao, born in 1987, M. S. His research interests include software development, data mining.
  • Supported by:
    the National Natural Science Foundation of China(61872260)

摘要:

针对社交媒体平台上消息内容普遍很短、传播结构中存在大量空转发、用户角色与内容间的失配等条件约束,提出了一种基于传播网络中的用户属性信息和消息内容的谣言检测模型GMB_GMU。首先以用户属性为节点、传播链为边构建用户传播网络,并引入图注意力网络(GAT)得到用户属性的增强表示;同时,基于此用户传播网络,利用node2vec得到用户的结构表征,并使用互注意机制对其进行增强。另外,引入BERT建立源帖内容表征。最后,利用多模态门控单元(GMU)对用户属性表征、结构表征和源帖内容表征进行融合,从而得到消息的最终表征。实验结果表明,GMB_GMU模型在公开的Weibo数据上的准确率达到0.952,能够有效识别谣言事件,效果明显优于基于循环神经网络(RNN)和其他神经网络基准模型的传播算法。

关键词: 谣言检测, 用户属性, 图注意力网络, 多模态门控单元, 传播网络

Abstract:

Under the constrains of very short message content on social media platforms, a large number of empty forwards in the transmission structure, and the mismatch between user roles and contents, a rumor detection model based on user attribute information and message content in the propagation network, namely GMB_GMU, was proposed. Firstly, user propagation network was constructed with user attributes as nodes and propagation chains as edges, and Graph Attention neTwork (GAT) was introduced to obtain an enhanced representation of user attributes; meanwhile, based on this user propagation network, the structural representation of users was obtained by using node2vec, and it was enhanced by using mutual attention mechanism. In addition, BERT (Bidirectional Encoder Representations from Transformers) was introduced to establish the source post content representation of the source post. Finally, to obtain the final message representation, Gated Multimodal Unit (GMU) was used to integrate the user attribute representation, structural representation and source post content representation. Experimental results show that the GMB_GMU model achieves an accuracy of 0.952 on publicly available Weibo data and can effectively identify rumor events, which is significantly better than the propagation algorithms based on Recurrent Neural Network (RNN) and other neural network benchmark models.

Key words: rumor detection, user attribute, Graph Attention neTwork (GAT), Gated Multimodal Unit (GMU), propagation network

中图分类号: