Journal of Computer Applications ›› 2025, Vol. 45 ›› Issue (9): 3036-3044.DOI: 10.11772/j.issn.1001-9081.2024091304

• Frontier and comprehensive applications • Previous Articles    

Action recognition algorithm for ADHD patients using skeleton and 3D heatmap

Chao SHI1, Yuxin ZHOU1, Qian FU1, Wanyu TANG1, Ling HE1, Yuanyuan LI2()   

  1. 1.College of Biomedical Engineering,Sichuan University,Chengdu Sichuan 610065,China
    2.Mental Health Center of West China Hospital,Sichuan University,Chengdu Sichuan 610041,China
  • Received:2024-09-14 Revised:2025-01-15 Accepted:2025-01-24 Online:2025-03-24 Published:2025-09-10
  • Contact: Yuanyuan LI
  • About author:SHI Chao, born in 1997, M. S. candidate. His research interests include image processing.
    ZHOU Yuxin, born in 2003. Her research interests include artificial intelligence.
    FU Qian, born in 2001, M. S. candidate. Her research interests include deep learning.
    TANG Wanyu, born in 2001, M.S. candidate. His research interests include video image processing.
    HE Ling, born in 1981, Ph. D., associate professor. Her research interests include medical signal processing, medical image processing.
  • Supported by:
    Science and Technology Program of Sichuan Province(2023YFS0290)

基于骨架和3D热图的注意缺陷多动障碍患者动作识别算法

石超1, 周昱昕1, 扶倩1, 唐万宇1, 何凌1, 李元媛2()   

  1. 1.四川大学 生物医学工程学院,成都 610065
    2.四川大学 华西医院心理卫生中心,成都 610041
  • 通讯作者: 李元媛
  • 作者简介:石超(1997—),男,贵州铜仁人,硕士研究生,主要研究方向:图像处理
    周昱昕(2003—),女,广西桂林人,主要研究方向:人工智能
    扶倩(2001—),女,四川成都人,硕士研究生,主要研究方向:深度学习
    唐万宇(2001—),男,四川自贡人,硕士研究生,主要研究方向:视频图像处理
    何凌(1981—),女,四川成都人,副教授,博士,主要研究方向:医学信号处理、医学图像处理
  • 基金资助:
    四川省科技计划项目(2023YFS0290)

Abstract:

Attention Deficit Hyperactivity Disorder (ADHD) is a neurodevelopmental disorder common in childhood, characterized by inattention, hyperactivity, and impulsivity, often exhibiting specific motion patterns. Traditional action recognition algorithms have problems such as low recognition accuracy and slow response when handling these specific actions. To address these issues, an action recognition algorithm for ADHD patients using skeleton and 3D heatmap was proposed, and spatial relationships between joints were represented using Gaussian distribution precisely, which preserved spatio-temporal information effectively. To overcome the limitations of single-modal data, a multimodal integration method based on skeleton and 3D heatmap was introduced. At the same time, the output features of Short 3D-CNN (3D Convolutional Neural Network) and Adaptive Graph Convolutional Network (AGCN) were fused to fully exploit the advantages of both modalities, thereby improving action recognition performance. Experimental results on the ADHD patient dataset collected by Mental Health Center of West China Hospital, Sichuan University, show that the proposed algorithm achieves the Top-1 recognition accuracy of 0.860 4 and the Top-5 recognition accuracy of 0.987 3 for eight different types of actions. Additionally, an automatic ADHD classification algorithm based on action types was proposed, which classified ADHD into head and facial action type, trunk action type, and limb action type, achieving the recognition accuracy of 75% and the response time of 5 seconds. Compared with two-stream AGCN (2s-AGCN) and PoseConv3D, the proposed algorithm demonstrates higher recognition accuracy in complex action scenarios, providing a new technical approach for personalized ADHD intervention.

Key words: Attention Deficit Hyperactivity Disorder (ADHD), action recognition, skeleton data, Graph Convolutional Network (GCN), 3D Convolutional Neural Network (CNN)

摘要:

注意缺陷多动障碍(ADHD)是一种常见于儿童期的神经发育障碍,以注意力不集中、多动和冲动为主要特征,常表现出特定的动作模式。传统的动作识别算法在处理这些特定动作时存在识别准确率低和响应慢等问题。为解决这些问题,提出基于骨架和3D热图的注意缺陷多动障碍患者动作识别算法,并通过高斯分布精确地表示关节点间的空间关系,以有效地保留时空信息。针对单一模态数据的限制,引入基于骨架和3D热图的多模态集成方法。同时,通过融合Short 3D-CNN(3D Convolutional Neural Network)和自适应图卷积网络(AGCN)的输出特征,充分利用两种模态数据的优势,从而提升动作识别性能。在四川大学华西医院心理卫生中心采集的ADHD患者数据集上的实验结果表明,对于8种不同类型的动作,所提算法的Top-1识别准确率为0.860 4,Top-5识别准确率为0.987 3。此外,提出基于动作类型的ADHD自动分型算法,该算法将ADHD分型为头面部体动型、躯干体动型和四肢体动型,它的识别准确率为75%,响应时间为5 s。与2s-AGCN(two-stream AGCN)和PoseConv3D相比,所提算法在复杂动作场景下具有更高的识别精度,为ADHD的个性化干预提供了新的技术手段。

关键词: 注意缺陷多动障碍, 动作识别, 骨架数据, 图卷积网络, 3D卷积神经网络

CLC Number: