计算机应用 ›› 2013, Vol. 33 ›› Issue (11): 3134-3137.

• 人工智能 • 上一篇    下一篇

基于树状朴素贝叶斯模型的社会网络关系预测

伍杰华1,2   

  1. 1. 华南理工大学 信息科学与技术学院,广州 510641
    2. 广东工贸职业技术学院 计算机工程系,广州 510510
  • 收稿日期:2013-05-02 修回日期:2013-06-14 出版日期:2013-11-01 发布日期:2013-12-04
  • 通讯作者: 伍杰华
  • 作者简介:伍杰华(1982-),男,广东广州人,讲师,高级工程师,博士,主要研究方向:机器学习、数据挖掘、社会网络分析。
  • 基金资助:
    国家自然科学基金资助项目;广东省教育部产学研结合项目

TAN Model For Ties Prediction In Social Networks

WU Jiehua1,2   

  1. 1. College of Information Science and Technology, South China University of Technology University, Guangzhou Guangdong 510641, China
    2. Department of Computer Science and Engineering, Guangdong College of Industry and Commerce, Guangzhou Guangdong 510510, China;
  • Received:2013-05-02 Revised:2013-06-14 Online:2013-12-04 Published:2013-11-01
  • Contact: WU Jiehua

摘要: 在社会网络关系预测研究领域,把基于拓扑结构信息的共邻节点属性作为相似性度量的预测模型应用比较广泛,但是该类算法具有较强的假设独立性,不能完全反映社会网络的“链接”结构。引入树状朴素贝叶斯(TAN)分类模型,采用信息熵度量节点对的角色,赋予共邻节点集合差异化的贡献权重进行社会关系预测,同时把模型推广到CN,AA和RA 等3种基于相似度的链接预测算法中。对5个真实社会网络采用AUC和ROC曲线进行实验评价后证明,该模型能够在深入挖掘共邻节点对贡献及解决共邻节点角色独立性的基础上提高预测精确度,同时为该类模型的研究提供一种新的方案。

关键词: 社会网络分析, 关系预测, 链接预测, 共邻节点, 贝叶斯模型

Abstract: In the research field of social ties prediction, taking common neighbors property as the similarity-based topological measure to carry the task of prediction has been widely used and better results have been achieved, which nevertheless has strong assuming independence and can not reflect the "link" and related network structure. This paper proposed a new measure of link prediction by introducing a Tree Augmented Nave Bayesians (TAN) classification model, which used information entropy measure to define the role of the node pair and gave differentiated neighbors set contribution to the task of social ties prediction, and then it was extended to Common Neighbor (CN), Adamic-Adar (AA) and Resource Allocation (RA) similarity-based prediction algorithms. The experimental evaluation by Area Under ROC Curve (AUC) and Receiver Operating Characteristic (ROC) curve on five real social networks prove that the proposed model can mine the latent common neighbors contribution and alleviate the independence hypothesis, which leads to enhance the accuracy of link prediction.

Key words: social network analysis, tie prediction, link prediction, Common Neighbor (CN), Bayesian model

中图分类号: