Journal of Computer Applications ›› 2022, Vol. 42 ›› Issue (3): 663-670.DOI: 10.11772/j.issn.1001-9081.2021040790

• 2021 CCF Conference on Artificial Intelligence (CCFAI 2021) •     Next Articles

Network embedding method based on multi-granularity community information

Jun HU1,2(), Zhengkang XU1,2, Li LIU1,2, Fujin ZHONG1,2   

  1. 1.Chongqing Key Laboratory of Computational Intelligence (Chongqing University of Posts and Telecommunications),Chongqing 400065,China
    2.College of Computer Science and Technology,Chongqing University of Posts and Telecommunications,Chongqing 400065,China
  • Received:2021-05-17 Revised:2021-06-11 Accepted:2021-06-23 Online:2021-11-09 Published:2022-03-10
  • Contact: Jun HU
  • About author:XU Zhengkang, born in 1996, M. S. candidate. His research interests include machine learning, intelligent information processing.
    LIU Li, born in 1988, Ph. D., associate professor. His research interests include Web mining, social computing.
    ZHONG Fujin, born in 1979, Ph. D., associate professor. His research interests include computer vision, knowledge discovery.
  • Supported by:
    National Key Research and Development Program of China(2017YFC0804002);National Natural Science Foundation of China(61876201)

融合多粒度社区信息的网络嵌入方法

胡军1,2(), 许正康1,2, 刘立1,2, 钟福金1,2   

  1. 1.计算智能重庆市重点实验室(重庆邮电大学),重庆 400065
    2.重庆邮电大学 计算机科学与技术学院,重庆 400065
  • 通讯作者: 胡军
  • 作者简介:许正康(1996—),男,安徽马鞍山人,硕士研究生,主要研究方向:机器学习、智能信息处理
    刘立(1988—),男,河南开封人,副教授,博士,主要研究方向:网络挖掘、社交计算
    钟福金(1979—),男,重庆人,副教授,博士,主要研究方向:计算机视觉、知识发现。
  • 基金资助:
    国家重点研发计划项目(2017YFC0804002);国家自然科学基金资助项目(61876201)

Abstract:

Most of the existing network embedding methods only preserve the local structure information of the network, while they ignore other potential information in the network. In order to preserve the community information of the network and reflect the multi-granularity characteristics of the network community structure, a network Embedding method based on Multi-Granularity Community information (EMGC) was proposed. Firstly, the network’s multi-granularity community structure was obtained, the node embedding and the community embedding were initialized. Then, according to the node embedding at previous level of granularity and the community structure at this level of granularity, the community embedding was updated, and the corresponding node embedding was adjusted. Finally, the node embeddings under different community granularities were spliced to obtain the network embedding that fused the community information of different granularities. Experiments on four real network datasets were carried out. Compared with the methods that do not consider community information (DeepWalk, node2vec) and the methods that consider single-granularity community information (ComE, GEMSEC), EMGC’s AUC value on link prediction and F1 score on node classification are generally better than those of the comparison methods. The experimental results show that EMGC can effectively improve the accuracy of subsequent link prediction and node classification.

Key words: network representation, network embedding, multi-granularity, community detection, complex network

摘要:

现有大多数网络嵌入方法仅保留了网络的局部结构信息,而忽略了网络中的其他潜在信息。为了保留网络的社区信息,并体现网络社区结构的多粒度特性,提出一种融合多粒度社区信息的网络嵌入方法(EMGC)。首先,获得网络的多粒度社区结构,并初始化节点嵌入和社区嵌入;然后,根据上一粒度上的节点嵌入和本层粒度的社区结构,更新社区嵌入,进而调整相应的节点嵌入;最后,对不同粒度下的节点嵌入进行拼接,从而得到融合多粒度社区信息的网络嵌入结果。在4个真实网络数据集上进行实验,相较于未考虑社区信息的方法(DeepWalk、node2vec)和考虑了单一粒度社区信息的方法(ComE、GEMSEC),EMGC在链接预测上的AUC值和节点分类上的F1值总体上优于对比方法。实验结果表明EMGC能够有效提升后续链接预测和节点分类的准确率。

关键词: 网络表示, 网络嵌入, 多粒度, 社区发现, 复杂网络

CLC Number: