基于骨架图与混合注意力的视频行人异常检测方法（BigData2023_P00320)

• •

基于骨架图与混合注意力的视频行人异常检测方法（BigData2023_P00320)

刘禹含¹,吉根林²,张红苹¹

1. 南京师范大学
2. 南京师范大学计算机科学与技术学院，南京210023

收稿日期:2023-08-29 修回日期:2023-09-11 发布日期:2023-12-18
通讯作者: 吉根林
基金资助:
国家自然科学基金

Video Pedestrian Anomaly Detection Method Based on Skeleton Graph and Mixed Attention

Received:2023-08-29 Revised:2023-09-11 Online:2023-12-18
Contact: JI Genlin
Supported by:
National Natural Science Foundation of China

摘要/Abstract

摘要： 人体骨架曾被广泛应用于行为识别等领域，其作为一种拓扑结构的描述方式对光照变化以及背景噪声具有良好的鲁棒性，因此非常适合研究视频行人异常检测。近些年来许多研究通过时空图卷积网络构建模型进行检测，但这类方法中描述人体骨架连接强弱的方式一般只考虑到直接相连的节点，所关注的运动区域较小且忽略了局部特征，要做到准确检测行人异常事件依然存在很大的困难。因此提出了一种基于骨架图与混合注意力的视频行人异常检测算法 PAD-SGMA，该方法首先扩展骨架点之间的关联，将根节点与未直接相连的节点进行连接，并且对人体骨架图进行划分获取人体骨架局部特征，在图卷积模块中利用静态全局骨架、局部区域骨架和基于注意的邻接矩阵来捕获层次表示。其次，提出新的时空通道混合注意图卷积网络，增加混合注意力模块，关注空间和通道关系，帮助模型增强区分特征且对每个关节进行不同程度的关注。为了验证所提出的模型，本文在大规模的公开标准数据集（ShanghaiTech Campus 数据集）上进行实验，结果表明 PAD-SGMA 与其他方法相比准确率更高。

关键词: 视频异常检测, 深度学习, 人体骨架, 图卷积网络, 注意力

Abstract: Human skeleton has been widely used in the field of behavior recognition, and as a topological structure description method, it has good robustness to light changes and background noise, so it is very suitable for the study of video pedestrian anomaly detection. In recent years, spatiotemporal graph convolutional networks have been used to construct models for detection. However, most of the methods used to describe the strength of human skeleton connection only consider directly connected nodes, focus on small moving areas, and ignore local features. It is still very difficult to accurately detect pedestrian abnormal events. Therefore, a video pedestrian anomaly detection algorithm, PAD-SGMA, based on skeleton graph and mixed attention, is proposed. This method first expands the association between skeleton points, connects the root node with the node that is not directly connected, and divides the human skeleton graph to obtain the local features of the human skeleton. In the graph convolution module, static global skeleton, local region skeleton and attention-based adjacency matrix are used to capture the hierarchical representation. Secondly, a new convolutional network of spatiotemporal channels mixed attention graphs is proposed to increase the attention space and channel relations of the mixed attention module, which helps the model enhance the distinguishing features and give different levels of attention to each joint. In order to verify the proposed model, experiments are conducted on a large-scale open standard dataset (ShanghaiTech Campus dataset), and the results show that PAD-SGMA is more accurate than other methods.

Key words: video anomaly detection, deep learning, human skeleton, graph convolutional network, attention

中图分类号:

TP391.4

刘禹含吉根林张红苹. 基于骨架图与混合注意力的视频行人异常检测方法（BigData2023_P00320)[J]. 计算机应用.

[1]	翟社平, 杨晴, 黄妍, 杨锐. 融合有向关系与关系路径的层次注意力的知识图谱补全[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1148-1156.
[2]	王利琴, 耿智雷, 李英双, 董永峰, 边萌. 基于路径和增强三元组文本的开放世界知识推理模型[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1177-1183.
[3]	胡婕, 郑启扬, 孙军, 张龑. 基于多标签关系图和局部动态重构学习的多标签分类模型[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1104-1112.
[4]	徐春, 吉双焱, 马欢, 孙恩威, 王萌萌, 苏明钰. 基于知识图谱和对话结构的问诊推荐方法[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1157-1168.
[5]	周阳, 李辉. 基于语义和细节特征双促进的遥感影像建筑物提取网络[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1310-1316.
[6]	张李伟, 梁泉, 胡禹涛, 朱乔乐. 基于分组卷积的通道重洗注意力机制[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1069-1076.
[7]	姜坤元, 李小霞, 王利, 曹耀丹, 张晓强, 丁楠, 周颖玥. 引入解耦残差自注意力的边界交叉监督语义分割网络[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1120-1129.
[8]	党伟超, 宋楚君, 高改梅, 刘春霞. 基于级联残差图卷积网络的多行为推荐[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1223-1231.
[9]	郭诗月, 党建武, 王阳萍, 雍玖. 结合注意力机制和多尺度特征融合的三维手部姿态估计[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1293-1299.
[10]	潘理虎, 彭守信, 张睿, 薛之洋, 毛旭珍. 面向运动前景区域的视频异常检测[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1300-1309.
[11]	王一丁, 王泽浩, 李耀利, 蔡少青, 袁媛. 多尺度2D-Adaboost的中药材粉末显微图像识别算法[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1325-1332.
[12]	李嘉欣, 莫思特. 基于MiniRBT-LSTM-GAT与标签平滑的台区电力工单分类[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1356-1362.
[13]	薛振华, 李强, 黄超. 视觉基础模型驱动的像素级图像异常检测方法[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 823-831.
[14]	耿海军, 董赟, 胡治国, 池浩田, 杨静, 尹霞. 基于Attention-1DCNN-CE的加密流量分类方法[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 872-882.
[15]	佘本杰, 苏树智, 朱彦敏, 华健, 王超. 基于非全局依赖积分回归的轻量姿态估计网络[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 972-977.