Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (6): 1728-1737.DOI: 10.11772/j.issn.1001-9081.2025060754

• Artificial intelligence • Previous Articles    

Masked autoencoder enhanced dynamic heterogeneous graph representation learning model

Haoran YUAN1, Huan LIU1, Pengfei JIAO1,2, Zhidong ZHAO1(), Xianfei ZHANG3, Zunliang LIU4   

  1. 1.School of Cyberspace Security,Hangzhou Dianzi University,Hangzhou Zhejiang 310018,China
    2.Zhejiang Provincial Engineering Research Center for Data Security Governance (Hangzhou Dianzi University),Hangzhou Zhejiang 310018,China
    3.School of Electronics and Information,Hangzhou Dianzi University,Hangzhou Zhejiang 310018,China
    4.Hangzhou Meichuang Technology Company Limited,Hangzhou Zhejiang 310015,China
  • Received:2025-07-09 Revised:2026-01-12 Accepted:2026-01-26 Online:2026-01-29 Published:2026-06-10
  • Contact: Zhidong ZHAO
  • About author:YUAN Haoran, born in 2000, M. S. His research interests include graph neural network, data mining.
    LIU Huan, born in 1999, Ph. D. candidate. His research interests include complex network analysis, graph neural network, data mining.
    JIAO Pengfei, born in 1990, Ph. D., professor. His research interests include complex network analysis and its application.
    ZHANG Xianfei, born in 1980, M. S., associate professor. His research interests include autonomous unmanned system, medical signal analysis.
    LIU Zunliang, born in 1970, senior engineer. His research interests include data management, information security.
    First author contact:ZHAO Zhidong, born in 1976, Ph. D., professor. His research interests include explainable artificial intelligence, signal system.
  • Supported by:
    General Program of National Natural Science Foundation of China(62372146);“Pioneer” and “Leading Goose” Research and Development Program of Zhejiang Province(2024C01023)

掩码自编码器增强的动态异质图表示学习模型

袁浩然1, 刘欢1, 焦鹏飞1,2, 赵治栋1(), 张显飞3, 柳遵梁4   

  1. 1.杭州电子科技大学 网络空间安全学院,杭州 310018
    2.数据安全治理浙江省工程研究中心(杭州电子科技大学),杭州 310018
    3.杭州电子科技大学 电子信息学院,杭州 310018
    4.杭州美创科技股份有限公司,杭州 310015
  • 通讯作者: 赵治栋
  • 作者简介:袁浩然(2000—),男,江苏靖江人,硕士,主要研究方向:图神经网络、数据挖掘
    刘欢(1999—),男,河南周口人,博士研究生,CCF会员,主要研究方向:复杂网络分析、图神经网络、数据挖掘
    焦鹏飞(1990—),男,河南洛阳人,教授,博士,CCF会员,主要研究方向:复杂网络分析及其应用
    张显飞(1980—),男,安徽宿松人,副教授,硕士,主要研究方向:自主无人系统、医疗信号分析
    柳遵梁(1970—),男,浙江宁波人,高级工程师,主要研究方向:数据管理、信息安全。
    第一联系人:共同第一作者
    赵治栋(1976—),男,山东泰安人,教授,博士,主要研究方向:可解释人工智能、信号系统
  • 基金资助:
    国家自然科学基金面上项目(62372146);浙江省“尖兵”“领雁”研发攻关计划项目(2024C01023);浙江省“尖兵”“领雁”研发攻关计划项目(2024C01102);浙江省“尖兵”“领雁”研发攻关计划项目(2025C01023)

Abstract:

Real-world networks are often composed of multiple types of entities and interaction relationships, with topological structure and attributes evolving with time continuously. The heterogeneity and dynamics inherent in such networks can be fully described by Dynamic Heterogeneous Graph (DHG). To solve the problems of coarse spatio-temporal information fusion and heavy reliance of the supervised learning paradigm on manual labels in the existing DHG representation learning models, a Masked AutoEncoder (MAE) enhanced DHG representation learning model was proposed. Firstly, heterogeneous spatial information was fused through a multi-level attention structure, and temporal information was fused across snapshots. Then, representation information of nodes was enriched by leveraging the reconstruction loss of the masked autoencoder. Experimental results show that improvements of at least 1.26 to 3.99 percentage points in Area Under the receiver operating Characteristic curve (AUC) are achieved by the proposed model on link prediction tasks compared to baseline models on multiple real-world datasets. It can be seen that the proposed model provides an effective self-supervised framework for DHG representation learning, facilitating more precise capture of heterogeneous information and dynamic evolution laws in real networks.

Key words: representation learning, Dynamic Heterogeneous Graph (DHG), Masked AutoEncoder (MAE), self-supervised learning, link prediction

摘要:

现实世界网络通常由多类型实体和交互关系构成,且拓扑结构及属性随时间不断演化。这些网络所蕴含的异质性和动态性可以通过动态异质图(DHG)数据完整描述。为了解决现有DHG表示学习模型存在的时空信息融合较粗糙,及它们的监督学习范式强依赖于人工标签的问题,提出掩码自编码器(MAE)增强的DHG表示学习模型。首先,通过多层次注意力结构融合异质空间信息,并进行跨快照的时间信息聚合;其次,利用掩码自编码器的重建损失丰富节点的表示信息。实验结果表明,所提模型在多个真实世界数据集上的链路预测任务中相较于基线模型有至少1.26~3.99个百分点的受试者工作特征曲线下面积(AUC)提升。可见,所提模型为DHG表示学习提供了一个有效的自监督框架,有助于更精确地捕捉真实网络中的异质信息与动态演化规律。

关键词: 表征学习, 动态异质图, 掩码自编码器, 自监督学习, 链路预测

CLC Number: