《计算机应用》唯一官方网站 ›› 2023, Vol. 43 ›› Issue (5): 1489-1496.DOI: 10.11772/j.issn.1001-9081.2022081218

• 数据科学与技术 • 上一篇    下一篇

基于节点多关系的社团挖掘算法及其应用

周琳1,2,3, 肖玉芝1,2,3(), 刘鹏1,2,3, 秦有鹏1,2,3   

  1. 1.青海师范大学 计算机学院, 西宁 810016
    2.青海省藏文信息处理与机器翻译重点实验室(青海师范大学), 西宁 810008
    3.藏文信息处理教育部重点实验室(青海师范大学), 西宁 810008
  • 收稿日期:2022-07-19 修回日期:2022-09-15 接受日期:2022-09-23 发布日期:2023-05-08 出版日期:2023-05-10
  • 通讯作者: 肖玉芝
  • 作者简介:周琳(1997—),女,辽宁大连人,硕士研究生,主要研究方向:社团划分
    肖玉芝(1980—),女,青海西宁人,教授,博士,CCF会员,主要研究方向:复杂网络理论、数据挖掘、网络舆情分析 qh_xiaoyuzhi@139.com
    刘鹏(1995—),男,山西定襄人,硕士研究生,主要研究方向:大数据分析
    秦有鹏(1995—),男,甘肃古浪人,硕士研究生,主要研究方向:传播动力学。
  • 基金资助:
    国家自然科学基金资助项目(61763041);青海省重点研发计划项目(2020?GX?112)

Community mining algorithm based on multi-relationship of nodes and its application

Lin ZHOU1,2,3, Yuzhi XIAO1,2,3(), Peng LIU1,2,3, Youpeng QIN1,2,3   

  1. 1.Computer College,Qinghai Normal University,Xining Qinghai 810016,China
    2.Tibetan Information Processing and Machine Translation Key Laboratory of Qinghai Province (Qinghai Normal University),Xining Qinghai 810008,China
    3.Key Laboratory of Tibetan Information Processing,Ministry of Education (Qinghai Normal University),Xining Qinghai 810008,China
  • Received:2022-07-19 Revised:2022-09-15 Accepted:2022-09-23 Online:2023-05-08 Published:2023-05-10
  • Contact: Yuzhi XIAO
  • About author:ZHOU Lin, born in 1997, M. S. candidate. Her research interests include community division.
    XIAO Yuzhi, born in 1980, Ph. D., professor. Her research interests include complex network theory, data mining, internet public opinion analysis.
    LIU Peng, born in 1995, M. S. candidate. His research interests include big data analytics.
    QIN Youpeng, born in 1995, M. S. candidate. His research interests include spread dynamics.
  • Supported by:
    National Natural Science Foundation of China(61763041);Key Research and Development Program of Qinghai Province(2020-GX-112)

摘要:

为度量多关系节点相似性、挖掘具有多关系节点的社团结构,提出基于节点多关系的社团挖掘算法LSL-GN。首先基于节点相似性和节点可达性刻画具有多关系的节点相似性度量指标LHN-ISL;然后利用该指标重构目标网络的低密度模型,并结合GN(Girvan-Newman)算法完成社团划分。将LSL-GN算法与多个经典社团挖掘算法在模块度(Q)、标准化互信息(NMI)和调整兰德指数(ARI)上进行对比,结果显示LSL-GN算法在3个指标上均优于经典算法,说明它的社团划分质量相对较好。将LSL-GN应用于“用户-应用”的移动漫游网络模型中,划分出了以携程旅行、高德地图、滴滴出行等为基础应用的社团结构,而这些社团划分结果可为设计个性化套餐业务提供策略参考信息。

关键词: 社团挖掘, 社团划分, 社团检测, 复杂网络, 移动漫游网络, 节点相似性, 节点可达性

Abstract:

In order to measure the similarity of multi-relational nodes and mine the community structure with multi-relational nodes, a community mining algorithm based on multi-relationship of nodes, called LSL-GN, was proposed. Firstly, based on node similarity and node reachability, LHN-ISL, a similarity measurement index for multi-relational nodes, was described to reconstruct the low-density model of the target network, and the community division was completed by combining with GN (Girvan-Newman) algorithm. The LSL-GN algorithm was compared with several classical community mining algorithms on Modularity (Q value), Normalized Mutual Information (NMI) and Adjusted Rand Index (ARI). The results show that LSL-GN algorithm achieves the best results in terms of three indexes, indicating that the community division quality of LSL-GN is better. The “User-Application” mobile roaming network model was divided by LSL-GN algorithm into community structures based on basic applications such as Ctrip, Amap and Didi Travel. These results of community division can provide strategic reference information for designing personalized package services.

Key words: community mining, community division, community detection, complex network, mobile roaming network, node similarity, node reachability

中图分类号: