计算机应用 ›› 2015, Vol. 35 ›› Issue (12): 3477-3480.DOI: 10.11772/j.issn.1001-9081.2015.12.3477

• 人工智能 • 上一篇    下一篇

基于潜在特征的重叠社团识别算法

孙辉霞1, 李跃新2   

  1. 1. 甘肃民族师范学院计算机科学系, 甘肃合作 747000;
    2. 湖北大学计算机与信息工程学院, 武汉 430062
  • 收稿日期:2015-05-28 修回日期:2015-08-05 出版日期:2015-12-10 发布日期:2015-12-10
  • 通讯作者: 孙辉霞(1971-),女(撒拉族),青海循化人,副教授,硕士,主要研究方向:算法分析、计算机图形学
  • 作者简介:李跃新(1958-),男,湖北武汉人,教授,博士,主要研究方向:人工智能、知识工程、智能控制系统、嵌入式系统。
  • 基金资助:
    国家自然科学基金资助项目(61170306);湖北省自然科学基金面上项目(2014CFB536);湖北省国际交流与合作项目(2012IHA0140);湖北省科技重大支撑项目(2014BAA089);湖北大学校自然科学基金资助项目(530-095183);甘肃民族师范学院院长基金资助项目(11-16)。

Overlapping community discovering algorithm based on latent features

SUN Huixia1, LI Yuexin2   

  1. 1. Department of Computer Science, Gansu Normal University for Nationalities, Hezuo Gansu 747000 China;
    2. School of Computer Science and Information Engineering, Wuhan University, Wuhan Hubei 430062, China
  • Received:2015-05-28 Revised:2015-08-05 Online:2015-12-10 Published:2015-12-10

摘要: 针对标签空间的指数增长这一问题,提出了一种基于潜在特征的重叠社团识别算法。首先,提出了一种包含重叠社团的网络产生式模型。根据该产生式模型,通过最大化目标网络的产生概率来推导网络中节点的潜在特征,并给出了优化目标函数。然后,通过将网络诱导为二部图,分析得出了潜在特征个数的下届,并据此对标签空间进行优化。实验表明,提出的重叠社团识别算法与BigClam算法相比较,在保持运行效率和查准率基本不变的前提下,可以明显提高检索结果的召回率。该算法可以有效地应对社团识别中标签空间的指数增长。

关键词: 社会网络, 重叠社团, 机器学习, 标注, 二部图

Abstract: In order to solve the problem of exponential increase of label space, an overlapping community discovery algorithm based on latent feature was proposed. Firstly, a generative model for network including overlapping communities was proposed. And based on the proposed generative model, an optimal object function was presented by maximizing the generative probability of the whole network, which was used to infer the latent features for each node in the network. Next, the network was induced into a bipartite graph, and the lower bound of feature number was analyzed, which was used to optimize the object function. The experiments show that, the proposed overlapping community discovering algorithm can improve the recall greatly while keeping the precision and execution efficiency unchanged, which indicates that the proposed algorithm is effective with the exponential increase of label space.

Key words: social network, overlapping community, machine learning, labelling, bipartite graph

中图分类号: