《计算机应用》唯一官方网站 ›› 2023, Vol. 43 ›› Issue (8): 2319-2324.DOI: 10.11772/j.issn.1001-9081.2022091334

• 第十九届CCF中国信息系统及应用大会 •    

面向超图的极大团搜索算法

徐兰天1, 李荣华1, 戴永恒2, 王国仁1   

  1. 1.北京理工大学 计算机学院,北京 100081
    2.电科云(北京)科技有限公司,北京 100041
  • 收稿日期:2022-09-06 修回日期:2022-09-23 接受日期:2022-10-08 发布日期:2022-11-02 出版日期:2023-08-10
  • 通讯作者: 李荣华
  • 作者简介:徐兰天(1997—),男,河北唐山人,硕士研究生,主要研究方向:图数据挖掘
    戴永恒(1983—),男,北京人,工程师,博士,主要研究方向:领域建模、专家知识结构化、融合的机器学习
    王国仁(1966—),男,北京人,教授,博士,CCF会员,主要研究方向:数据挖掘。
  • 基金资助:
    国家自然科学基金资助项目(62072034);国家重点研发计划课题(2021YFB3301301)

Maximal clique searching algorithm for hypergraphs

Lantian XU1, Ronghua LI1, Yongheng DAI2, Guoren WANG1   

  1. 1.School of Computer Science and Technology,Beijing Institute of Technology,Beijing 100081,China
    2.Diankeyun (Beijing) Technology Company Limited,Beijing 100041,China
  • Received:2022-09-06 Revised:2022-09-23 Accepted:2022-10-08 Online:2022-11-02 Published:2023-08-10
  • Contact: Ronghua LI
  • About author:XU Lantian, born in 1997, M. S. candidate. His research interests include graph data mining.
    DAI Yongheng, born in 1983, Ph. D., engineer. His research interests include domain modeling, expert knowledge structuring, integrated machine learning.
    WANG Guoren, born in 1966, Ph. D., professor. His research interests include data mining.
  • Supported by:
    National Natural Science Foundation of China(62072034);National Key Research and Development Program of China(2021YFB3301301)

摘要:

现实世界中的实体关系大多不能用简单的二元关系来表示,而超图能很好地表示实体间的多元关系。因此,提出超图团和极大团的定义,并给出了搜索超图极大团的精确算法和近似算法。首先,分析了现有的普通图上的极大团搜索算法无法直接应用到超图上的原因。然后,基于超图的特性和极大团的定义,提出了一种新颖的保存超点间邻接关系的数据结构,并提出了一种超图上的精确极大团搜索算法。由于精确算法的速度较慢,因此结合支撑点(pivot)的剪枝思想,削减递归层数,提出了一种超图上的近似极大团搜索算法。在多个真实超图数据集上的实验结果显示,所提近似算法在找到大多数极大团的前提下,提高了搜索速度,当在3-uniform超图上,测试超图团的点数为22时,加速比达到了1 000以上。

关键词: 超图, 极大团, 集合枚举, 近似算法, 支撑点

Abstract:

Most of entity relationships in the real world cannot be represented by simple binary relations, and hypergraph can represent the n-ary relations among entities well. Therefore, definitions of hypergraph clique and maximal clique were proposed, and the exact algorithm and approximation algorithm for searching hypergraph maximal clique were given. First, the reason why the existing maximal clique searching algorithms on ordinary graphs cannot be applied to hypergraphs directly was analyzed. Then, based on the characteristics of hypergraph and the definition of maximal clique, a novel data structure for preserving the adjacency relations among hyperpoints was proposed, and an accurate maximal clique searching algorithm on hypergraph was proposed. As the running of the exact algorithm is slow, the pruning idea of pivots was combined with, the number of recursive layers was reduced, and an approximation maximal clique searching algorithm on hypergraph was proposed. Experimental results on multiple real hypergraph datasets show that under the premise finding most maximal cliques, the proposed approximation algorithm improves the search speed. When the number of test hypergraph cliques on 3-uniform hypergraph is 22, the acceleration ratio reaches over 1 000.

Key words: hypergraph, maximal clique, set enumeration, approximation algorithm, pivot

中图分类号: