Journal of Computer Applications ›› 2023, Vol. 43 ›› Issue (3): 736-743.DOI: 10.11772/j.issn.1001-9081.2022020207

• Artificial intelligence • Previous Articles    

Pedestrian trajectory prediction based on multi-head soft attention graph convolutional network

Tao PENG1,2,3, Yalong KANG2,3, Feng YU1,3(), Zili ZHANG2,3, Junping LIU2,3, Xinrong HU2,3, Ruhan HE1,3, Li LI1,2   

  1. 1.Hubei Provincial Engineering Research Center for Intelligent Textile and Fashion (Wuhan Textile University),Wuhan Hubei 430200,China
    2.Engineering Research Center of Hubei Province for Clothing Information (Wuhan Textile University),Wuhan Hubei 430200,China
    3.School of Computer Science and Artificial Intelligence,Wuhan Textile University,Wuhan Hubei 430200,China
  • Received:2022-02-24 Revised:2022-05-17 Accepted:2022-05-19 Online:2022-08-16 Published:2023-03-10
  • Contact: Feng YU
  • About author:PENG Tao, born in 1981, Ph. D., professor. His research interests include data reduction, pattern recognition, network security.
    KANG Yalong, born in 1997, M. S. candidate. His research interests include computer vision.
    ZHANG Zili, born in 1981, Ph. D., lecturer. His research interests include image processing, computer vision.
    LIU Junping, born in 1980, Ph. D., associate professor. His research interests include computer vision.
    HU Xinrong, born in 1973, Ph. D., professor. Her research interests include graphics and image processing.
    HE Ruhan, born in 1974, Ph. D., professor. His research interests include machine learning, artificial intelligence.
    LI Li, born in 1982, Ph. D., associate professor. Her research interests include machine vision, optical nondestructive testing.
  • Supported by:
    National Natural Science Foundation of China(61901308);Youth Project of Education Department of Hubei Province(Q201316);Key Project of Scientific Research Plan of Education Department of Hubei Province(D20191708)

基于多头软注意力图卷积网络的行人轨迹预测

彭涛1,2,3, 康亚龙2,3, 余锋1,3(), 张自力2,3, 刘军平2,3, 胡新荣2,3, 何儒汉1,3, 李丽1,2   

  1. 1.纺织服装智能化湖北省工程研究中心(武汉纺织大学), 武汉 430200
    2.湖北省服装信息化工程技术研究中心(武汉纺织大学), 武汉 430200
    3.武汉纺织大学 计算机与人工智能学院, 武汉 430200
  • 通讯作者: 余锋
  • 作者简介:彭涛(1981—),男,湖北武汉人,教授,博士,CCF会员,主要研究方向:数据简化、模式识别、网络安全
    康亚龙(1997—),男,湖北孝感人,硕士研究生,CCF会员,主要研究方向:计算机视觉
    余锋(1989—),男,湖北武汉人,讲师,博士,CCF会员,主要研究方向:医学图像处理、光学成像
    张自力(1981—),男,湖北武汉人,讲师,博士,CCF会员,主要研究方向:图像处理、计算机视觉
    刘军平(1980—),男,湖北武汉人,副教授,博士,CCF会员,主要研究方向:计算机视觉
    胡新荣(1973—),女,湖北武汉人,教授,博士,CCF会员,主要研究方向:图形图像处理
    何儒汉(1974—),男,湖北武汉人,教授,博士,CCF会员,主要研究方向:机器学习、人工智能
    李丽(1982—),女,湖北武汉人,副教授,博士,CCF会员,主要研究方向:机器视觉、光学无损检测。
  • 基金资助:
    国家自然科学基金资助项目(61901308);湖北省教育厅青年项目(Q201316);湖北省教育厅科研计划重点项目(D20191708)

Abstract:

The complexity of pedestrian interaction is a challenge for pedestrian trajectory prediction, and the existing algorithms are difficult to capture meaningful interaction information between pedestrians, which cannot intuitively model the interaction between pedestrians. To address this problem, a multi-head soft attention graph convolutional network was proposed. Firstly, a Multi-head Soft ATTention (MS ATT) combined with involution network was used to extract sparse spatial adjacency matrix and sparse temporal adjacency matrix from spatial and temporal graph inputs respectively to generate sparse spatial directed graph and sparse temporal directed graph. Then, a Graph Convolutional Network (GCN) was used to learn interaction and motion trend features from sparse spatial and sparse temporal directed graphs. Finally, the learned trajectory features were input into a Temporal Convolutional Network (TCN) to predict double Gaussian distribution parameters, thereby generating the predicted pedestrian trajectories. Experiments on Eidgenossische Technische Hochschule (ETH) and University of CYprus (UCY) datasets show that, compared with Space-time sOcial relationship pooling pedestrian trajectory Prediction Model (SOPM), the proposed algorithm reduces the Average Displacement Error (ADE) by 2.78%, and compared to Sparse Graph Convolution Network (SGCN), the proposed algorithm reduces the Final Displacement Error (FDE) by 16.92%.

Key words: Multi-head Soft ATTention (MS ATT), channel attention, spatial attention, involution network, Graph Convolutional Network (GCN), pedestrian trajectory prediction

摘要:

行人间交互作用的复杂性给行人轨迹预测带来了挑战,且现有算法难以捕获行人间有意义的交互信息,不能直观地建模行人间的交互作用。针对以上问题,提出多头软注意力图卷积网络。首先利用多头软注意力(MS ATT)结合内卷网络Involution分别从空间图和时间图输入中提取稀疏空间和稀疏时间邻接矩阵,生成稀疏空间和稀疏时间有向图;然后,利用图卷积网络(GCN)从稀疏空间和稀疏时间有向图中学习交互作用与运动趋势特征;最后,将学习到的轨迹特征输入时间卷积网络(TCN)以预测双高斯分布参数,生成行人预测轨迹。在ETH和UCY数据集上的实验结果表明:相较于空时社交关系池化行人轨迹预测模型(SOPM),所提算法的平均位移误差(ADE)降低了2.78%;相较于稀疏图卷积网络(SGCN),所提算法的最终位移误差(FDE)降低了16.92%。

关键词: 多头软注意力, 通道注意力, 空间注意力, 内卷, 图卷积网络, 行人轨迹预测

CLC Number: