Journal of Computer Applications ›› 2018, Vol. 38 ›› Issue (6): 1698-1702.DOI: 10.11772/j.issn.1001-9081.2017102467

Previous Articles     Next Articles

Application of asymmetric information in link prediction

XIE Rui1,2, HAO Zhifeng3, LIU Bo1, XU Shengbing2   

  1. 1. School of Automation, Guangdong University of Technology, Guangzhou Guangdong 510006, China;
    2. School of Computers, Guangdong University of Technology, Guangzhou Guangdong 510006, China;
    3. School of Mathematics and Big Data, Foshan University, Foshan Guangdong 528000, China
  • Received:2017-10-18 Revised:2018-01-29 Online:2018-06-10 Published:2018-06-13
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61472090), the Science and Technology Project of Guangdong Province (2015B090906015), the Science and Technology Planning Project of Guangzhou (201707010492).

非对称信息在链接预测中的应用

谢锐1,2, 郝志峰3, 刘波1, 徐圣兵2   

  1. 1. 广东工业大学 自动化学院, 广州 510006;
    2. 广东工业大学 计算机学院, 广州 510006;
    3. 佛山科学技术学院 数学与大数据学院, 广东 佛山 528000
  • 通讯作者: 谢锐
  • 作者简介:谢锐(1977-),男,广东平远人,讲师,博士研究生,主要研究方向:机器学习、数据挖掘;郝志峰(1968-),男,江苏泰州人,教授,博士生导师,博士,主要研究方向:机器学习、数据挖掘;刘波(1978-),男,河南登封人,教授,博士生导师,博士,主要研究方向:机器学习;徐圣兵(1974-),男,湖南郴州人,讲师,博士研究生,主要研究方向:数据建模、迁移学习。
  • 基金资助:
    国家自然科学基金资助项目(61472090);广东省科技计划项目(2015B090906015);广州市科技计划项目(201707010492)。

Abstract: The prediction accuracy of link prediction based on node similarity is always reduced without considering the asymmetric information. In order to solve the problem, a novel method for node similarity measurement with asymmetric information was proposed. Firstly, the disadvantage of the similarity measure algorithm based on Common Neighbor (CN) was analyzed, which it only considered the number of CNs without considering the number of all neighbors of each node. Secondly, the similarity measure between nodes was defined as the ratio of the common nodes to all the neighbor nodes. Then, the symmetric similar information and the asymmetric similar information between nodes were combined, and the similarity between nodes was described in detail. Finally, the proposed method was applied to predict the link relationship in complex networks. The experimental results on the real datasets show that, compared with the previous common neighbor-based similarity measurement methods such as CN, Adamic Adar (AA) and Resource Allocation (RA), the proposed method can improve the accuracy of node similarity measurement and improve the accuracy of link relationship prediction in complex networks.

Key words: link prediction, similarity, similar coefficient, asymmetry, node similarity

摘要: 针对基于节点相似性的链接关系预测中因未考虑非对称信息导致预测准确度降低的问题,提出一种新的增加非对称信息的节点相似性度量方法。首先,分析了基于共同邻居(CN)的相似性度量算法的缺陷在于仅考虑CN的数量而未考虑各自节点的所有邻居的数量;然后,将节点之间的相似性度量定义为共同节点与所有邻居节点的比值,融合节点间对称相似信息和非对称相似信息,对节点间的相似程度进行深入细致的刻画;最后,将该方法应用到复杂网络中进行链接关系的预测。在真实数据集上的实验结果表明,与目前多种基于共邻的相似性度量方法——CN、AA、资源分配(RA)相比,所提方法提升了节点相似性度量的准确性,并且可以提高复杂网络中链接关系预测的准确度。

关键词: 链接预测, 相似性, 相似系数, 非对称, 节点相似

CLC Number: