《计算机应用》唯一官方网站 ›› 2022, Vol. 42 ›› Issue (10): 3060-3068.DOI: 10.11772/j.issn.1001-9081.2021081484

• 数据科学与技术 • 上一篇    

基于高阶自包含协同过滤的有向网络链路预测

陈广福1,2, 王海波3, 连雁平1,2   

  1. 1.武夷学院 数学与计算机学院, 福建 武夷山 354300
    2.认知计算与智能信息处理福建省高校重点实验室(武夷学院), 福建 武夷山 354300
    3.湖南科技学院 信息工程学院, 湖南 永州 425199
  • 收稿日期:2021-08-19 修回日期:2021-12-03 接受日期:2021-12-08 发布日期:2022-01-07 出版日期:2022-10-10
  • 通讯作者: 陈广福
  • 作者简介:第一联系人:陈广福(1979—),男,江西上饶人,讲师,博士,主要研究方向:链路预测、网络表示cgf21st@163.com
    王海波(1980—),男,湖南郴州人,讲师,硕士,主要研究方向:链路预测
    连雁平(1981—),男,福建莆田人,副教授,硕士,主要研究方向:机器学习、大数据。
  • 基金资助:
    武夷学院引进人才科研启动基金资助项目(YJ202017)

Link prediction in directed network based on high-order self-included collaborative filtering

Guangfu CHEN1,2, Haibo WANG3, Yanping LIAN1,2   

  1. 1.School of Mathematics and Computer Science,Wuyi University,Wuyishan Fujian 354300,China
    2.Key Laboratory of Cognitive Computing and Intelligent Information Processing of Fujian Education Institutions (Wuyi University),Wuyishan Fujian 354300,China
    3.College of Information Engineering,Hunan University of Science and Engineering,Yongzhou Hunan 425199,China
  • Received:2021-08-19 Revised:2021-12-03 Accepted:2021-12-08 Online:2022-01-07 Published:2022-10-10
  • Contact: Guangfu CHEN
  • About author:CHEN Guangfu, born in 1979, Ph. D. , lecturer. His research interests include link prediction, network representation.
    CHEN Guangfu, born in 1979, Ph. D. , lecturer. His research interests include link prediction, network representation.
    WANG Haibo, born in 1980, M. S. , lecturer. His research interests include link prediction.
    LIAN Yanping, born in 1981, M. S. , associate professor. His research interests include machine learning, big data.
  • Supported by:
    Research Foundation for Introduction of Talents of Wuyi University(YJ202017)

摘要:

针对大部分现存有向网络链路预测方法仅关注有向局部结构及互惠链接信息而忽略有向全局结构的问题,提出高阶自包含协同过滤(HSCF)链路预测框架。首先,利用随机游走方法计算高阶相似度矩阵去保持有向网络的高阶路径信息;其次,将高阶相似度矩阵与协同过滤方法相融合构建HSCF框架;最后,把所提框架分别与有向共同邻居(DCN)、有向Adamic-Adar(DAA)、有向资源分配(DRA)和势能理论Bifan 4个典型有向结构相似度相融合,并由此提出HSCF-DCN、HSCF-DAA、HSCF-DRA和HSCF-Bifan 4个有向网络预测指标。在10个真实有向网络上的实验结果表明,与基准指标相比,HSCF-DCN、HSCF-DAA、HSCF-DRA和HSCF-Bifan的受试者工作特征(ROC)曲线下方面积(AUC)值分别平均提高了8.16%、8.85%、9.64%和10.33%,且F分数值分别平均提高了66.62%、68.32%、68.95%和76.18%。

关键词: 有向复杂网络, 链路预测, 协同过滤, 有向结构, 随机游走

Abstract:

Aiming at the problem that most existing directed network link prediction methods only focus on the directed local and reciprocal link information and ignore the directed global structure information, a High-order Self-included Collaborative Filtering (HSCF) link prediction framework was proposed. Firstly, random walk method was used to calculate the high-order similarity matrix to preserve the high-order path information of the directed network. Secondly, an HSCF framework was constructed by combining the high-order similarity matrix with collaborative filtering method. Finally, the proposed framework was integrated with four typical directed structure similarity indices including Directed Common Neighbor (DCN), Directed Adamic-Adar (DAA), Directed Resource Allocation (DRA) and potential theory (Bifan), and four directed network prediction indices HSCF-DCN, HSCF-DAA, HSCF-DRA and HSCF-Bifan were proposed on this basis. Compared with the baseline indices on ten real directed networks, the experimental results show that the AUC (Area Under Curve of Receiver Operating Characteristic (ROC)) values of HSCF-DCN, HSCF-DAA, HSCF-DRA and HSCF-Bifan are increased by an average of 8.16%, 8.85%, 9.64% and 10.33% respectively and the F-score values of them are increased by an average of 66.62%, 68.32%, 68.95% and 76.18% respectively.

Key words: directed complex network, link prediction, collaborative filtering, directed structure, random walk

中图分类号: