《计算机应用》唯一官方网站 ›› 2023, Vol. 43 ›› Issue (4): 1109-1114.DOI: 10.11772/j.issn.1001-9081.2022040562

• 数据科学与技术 • 上一篇    下一篇

基于改进的局部结构熵复杂网络重要节点挖掘

李鹏1,2, 王世林1,2(), 陈光武1,2, 闫光辉3   

  1. 1.兰州交通大学 自动化与电气工程学院, 兰州 730070
    2.甘肃省高原交通信息工程及控制重点实验室(兰州交通大学), 兰州 730070
    3.兰州交通大学 电子与信息工程学院, 兰州 730070
  • 收稿日期:2022-04-24 修回日期:2022-08-20 接受日期:2022-08-30 发布日期:2022-10-14 出版日期:2023-04-10
  • 通讯作者: 王世林
  • 作者简介:李鹏(1985—),男,青海海东人,博士研究生,CCF会员,主要研究方向:复杂网络、智能交通、信息系统工程;
    陈光武(1976—),男,新疆阿克苏人,教授,博士,CCF会员,主要研究方向:交通信息工程及控制、智能感知、自主控制;
    闫光辉(1970—),男,河南商丘人,教授,博士,CCF会员,主要研究方向:数据挖掘、社交网络分析。
  • 基金资助:
    国家自然科学基金资助项目(62062049);甘肃省科技重大专项(21ZD4WA018);甘肃省科技引导计划项目(2020?61?14);甘肃省自然科学基金资助项目(20JR5RA390)

Key node mining in complex network based on improved local structural entropy

Peng LI1,2, Shilin WANG1,2(), Guangwu CHEN1,2, Guanghui YAN3   

  1. 1.School of Automation and Electrical Engineering,Lanzhou Jiaotong University,Lanzhou Gansu 730070,China
    2.Key Laboratory of Plateau Traffic Information Engineering and Control of Gansu Province (Lanzhou Jiaotong University),Lanzhou Gansu 730070,China
    3.School of Electronic and Information Engineering,Lanzhou Jiaotong University,Lanzhou Gansu 730070,China
  • Received:2022-04-24 Revised:2022-08-20 Accepted:2022-08-30 Online:2022-10-14 Published:2023-04-10
  • Contact: Shilin WANG
  • About author:LI Peng, born in 1985, Ph. D. candidate. His research interests include complex networks, intelligent transportation, information system engineering.
    CHEN Guangwu, born in 1976, Ph. D., professor. His research interests include traffic information engineering and control, intelligent perception, autonomous control.
    YAN Guanghui, born in 1970, Ph. D., professor. His research interests include data mining, social network analysis.
  • Supported by:
    National Natural Science Foundation of China(62062049);Gansu Science and Technology Major Project(21ZD4WA018);Gansu Province Science and Technology Guidance Plan(2020-61-14);Natural Science Foundation of Gansu Province(20JR5RA390)

摘要:

识别复杂网络中的关键节点对优化网络结构以及信息的有效传播起着至关重要的作用。局部结构熵(LE)利用局部网络对整个网络的影响代替节点对整个网络的影响以识别重要节点,然而LE未考虑高聚集性网络和节点与邻居节点形成环的情况,存在一定的局限性。针对以上不足,首先,提出了改进LE的节点重要性评价方法PLE(Penalized Local structural Entropy),即在LE的基础上引入集聚系数(CC)作为惩罚项,从而适当惩罚网络中的高聚集性节点;其次,由于PLE的惩罚项对三元闭包结构上的节点惩罚力度过大,又提出了PLE的改进方法PLEA(Penalized Local structural Entropy Advancement),即在惩罚项前引入一个控制系数,以控制惩罚力度。对5个不同规模的真实网络进行选择性攻击实验,实验结果表明,在美国西部各州电网和美国航空网两个网络中,与LE方法相比,PLEA的识别准确率分别提升了26.3%和3.2%;与K-Shell(KS)方法相比,PLEA的识别准确率分别提升了380%和5.43%;与DCL(Degree and Clustering coefficient and Location)方法相比,PLEA的识别准确率分别提升了14.4%和24%。同时,PLEA识别的重要节点对网络造成的破坏更大,验证了引入CC作为惩罚项的合理性,以及PLEA的有效性和优越性。PLEA综合考虑了节点的邻居个数和节点的局部网络结构,计算简单,对于刻画大规模网络的可靠性与抗毁性具有十分重要的意义。

关键词: 复杂网络, 重要节点, 局部结构熵, 惩罚项, 集聚系数

Abstract:

The identification of key nodes in complex network plays an important role in the optimization of network structure and effective propagation of information. Local structural Entropy (LE) can be used to identify key nodes by using the influence of the local network on the whole network instead of the influence of nodes on the whole network. However, the cases of the highly aggregative network and nodes forming a loop with neighbor nodes are not considered in LE, which leads to some limitations. To address these limitations, firstly, an improved LE based node importance evaluation method, namely PLE (Penalized Local structural Entropy), was proposed, in which based on the LE, the Clustering Coefficient (CC) was introduced as a penalty term to penalize the highly aggregative nodes in the network appropriately. Secondly, due to the fact that the penalty of PLE penalizing the nodes in triadic closure structure is too much, an improved method of PLE, namely PLEA (Penalized Local structural Entropy Advancement) was proposed, in which control coefficient was introduced in front of the penalty term to control the penalty strength. Selective attack experiments on five real networks with different sizes were conducted. Experimental results show that in the western US states grid and the US Airlines, PLEA has the identification accuracy improved by 26.3% and 3.2% compared with LE respectively, by 380% and 5.43% compared with K-Shell (KS) method respectively, and by 14.4% and 24% compared with DCL (Degree and Clustering coefficient and Location) method respectively. The key nodes identified by PLEA can cause more damage to the network, verifying the rationality of introducing the CC as a penalty term, and the effectiveness and superiority of PLEA. The integration of the number of neighbors and the local network structure of nodes with the simplicity of computation makes it more effective in describing the reliability and invulnerability of large-scale networks.

Key words: complex network, key node, Local structural Entropy (LE), penalty term, Clustering Coefficient (CC)

中图分类号: