计算机应用 ›› 2012, Vol. 32 ›› Issue (08): 2154-2158.DOI: 10.3724/SP.J.1087.2012.02154

• 先进计算 • 上一篇    下一篇

基于“次中心”的社区结构探寻算法

水超1,李慧2   

  1. 1. 国防科学技术大学 信息系统与管理学院,长沙 410073
    2. 国防科学技术大学 信息中心,长沙 410073
  • 收稿日期:2011-12-09 修回日期:2012-02-12 发布日期:2012-08-28 出版日期:2012-08-01
  • 通讯作者: 水超

Community structure identification algorithm based on subcenter

SHUI Chao1,LI Hui2   

  1. 1. College of Information System and Management, National University of Defense Technology, Changsha Hunan 410073, China
    2. Information Center, National University of Defense Technology, Changsha Hunan 410073, China
  • Received:2011-12-09 Revised:2012-02-12 Online:2012-08-28 Published:2012-08-01
  • Contact: SHUI Chao

摘要: 当前社区结构探测算法在寻求社区结构划分正确性的同时,算法效率较低。为此,提出一种在算法正确性和算法效率两个方面能取得较好均衡的社区结构探寻算法CoreScan。该算法寻找节点集合中一类称之为“次中心”的特殊节点,再将其作为聚类中心,然后通过D模块度来发现社区结构。理论分析表明,该算法能正确识别Fortunato提出的一类特殊社区结构,且算法效率可达O(n*kmax),其中n是节点数量,kmax是“次中心”最大数量。最后通过多项实验证明,CoreScan算法能够在效率和正确性上取得较好的均衡,适合于在大规模节点集合中进行快速社区结构探寻。

关键词: 社区结构, 社区探测, Q模块度, D模块度

Abstract: Some community structure identification algorithms in complex network research achieve high quality for identifying community correctly but low efficiency, but some algorithms were on the contrary. It is a challenge for community structure identifying algorithms to balance efficiency and correctness for category meanwhile. In this paper, a new algorithm named CoreScan was proposed. Different from existing algorithm, CoreScan found some special node named subcenter in network and using a preprocessing phase to divide nodes into several communities according to subcenter, then cluster nodes by evaluating the D modularity of community. The definition of subcenter was given and some certifications also were shown to guarantee the correction of CoreScan algorithm. At last, the algorithm was tested on two artificial networks and two real-world graphs. The experimental results show that the algorithm achieves the goal of linear efficiency, and the correction of identifying community is no less than existing algorithms. This algorithm is suitable to large-scale network structure investigation.

Key words: community structure, community detection, Q modularity, D modularity

中图分类号: