《计算机应用》唯一官方网站 ›› 2021, Vol. 41 ›› Issue (12): 3546-3550.DOI: 10.11772/j.issn.1001-9081.2021060981

• 第十八届中国机器学习会议(CCML 2021) • 上一篇    

基于异构图注意力网络的微博谣言监测模型

毕蓓1,2, 潘慧瑶1, 陈峰1, 隋京言4, 高扬3, 王耀君1()   

  1. 1.中国农业大学 信息与电气工程学院,北京 100083
    2.北京理工大学 计算机学院,北京 100081
    3.北京工业大学 经济与管理学院,北京 100124
    4.中国科学院 计算技术研究所,北京 100190
  • 收稿日期:2021-05-12 修回日期:2021-07-05 接受日期:2121-07-05 发布日期:2021-12-28 出版日期:2021-12-10
  • 通讯作者: 王耀君
  • 作者简介:毕蓓(2000—),女,山东菏泽人,硕士研究生,主要研究方向:图神经网络、联邦学习
    潘慧瑶(1999—),女,湖北荆州人,硕士研究生,主要研究方向:文本挖掘、知识图谱
    陈峰(1998—),男,浙江长兴人,硕士研究生,主要研究方向:虚拟现实、情感计算,智能设计
    隋京言(1982—),女,山东烟台人,博士研究生,主要研究方向:深度强化学习、组合优化、算法设计与分析
    高扬(1988—),女,山东烟台人,副教授,博士,主要研究方向:金融时间序列分析;
  • 基金资助:
    北京市自然科学基金青年项目(5214026);中国农业大学2115人才工程

Microblog rumor detection model based on heterogeneous graph attention network

Bei BI1,2, Huiyao PAN1, Feng CHEN1, Jingyan SUI4, Yang GAO3, Yaojun WANG1()   

  1. 1.College of Information and Electrical Engineering,China Agricultural University,Beijing 100083,China
    2.School of Computer Science and Technology,Beijing Institute of Technology,Beijing 100081,China
    3.College of Economics and Management,Beijing University of Technology,Beijing 100124,China
    4.Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China
  • Received:2021-05-12 Revised:2021-07-05 Accepted:2121-07-05 Online:2021-12-28 Published:2021-12-10
  • Contact: Yaojun WANG
  • About author:BI Bei, born in 2000, M. S. candidate. Her research interests include graph neural network, federated learning.
    PAN Huiyao, born in 1999, M. S. candidate. Her research interests include text mining, knowledge graph.
    CHEN Feng, born in 1998, M. S. candidate. His research interests include virtual reality, affective computing, intelligent design.
    SUI Jingyan, born in 1982, Ph. D. candidate. Her research interests include deep reinforcement learning, combinatorial optimization, algorithm design and analysis.
    GAO Yang, born in 1988, Ph. D., associate professor. Her research interests include financial time series analysis.
  • Supported by:
    the Youth Program of Beijing Natural Science Foundation(5214026);the 2115 Talent Development Program of China Agricultural University

摘要:

社交媒体方便了人们的日常交流和信息传播,同时也是谣言滋生和传播的温床,因此如何在谣言传播早期自动监测极具现实意义,而现有的检测方法没有充分利用微博信息传播图的语义信息。为了解决这个问题,基于异构图注意力网络(HAN)构建了谣言监测模型MicroBlog-HAN。该模型采用含有节点级注意力和语义级注意力的分层注意力机制。首先,节点级注意力结合微博节点的邻居生成两组具有特定语义的节点嵌入;然后,语义级注意力融合不同语义,得到最终的节点嵌入,并输入到分类器中执行二分类任务;最后,给出输入微博是谣言还是非谣言的分类结果。在两个真实的微博谣言数据集上的实验结果表明,MicroBlog-HAN模型可以实现微博谣言较准确的识别,准确率超过87%。

关键词: 微博, 谣言监测, 异构图, 元路径, 异构图注意力网络

Abstract:

Social media highly facilitates people’s daily communication and disseminating information, but it is also a breeding ground for rumors. Therefore, how to automatically monitor rumor dissemination in the early stage is of great practical significance, but the existing detection methods fail to take full advantage of the semantic information of the microblog information propagation graph. To solve this problem, based on Heterogeneous graph Attention Network (HAN), a rumor monitoring model was built, namely MicroBlog-HAN. In the model, a hierarchical attention mechanism including node-level attention and semantic-level attention was adopted. First, the neighbors of microblog nodes were combined by the node-level attention to generate two groups of node embeddings with specific semantics. After that, different semantics were fused by the semantic-level attention to obtain the final node embeddings of microblog, which were then treated as the classifier’s input to perform the binary classification task. In the end, the classification result of whether the input microblog is rumor or not was given. Experimental results on two real-world microblog rumor datasets convincingly prove that MicroBlog-HAN model can accurately identify microblog rumors with an accuracy over 87%.

Key words: microblog, rumor detection, heterogeneous graph, meta-path, Heterogeneous graph Attention Network (HAN)

中图分类号: