基于冗余特征抑制的轻量级人体姿态估计网络

doi:10.11772/j.issn.1001-9081.2025060700

《计算机应用》唯一官方网站

• • 下一篇

基于冗余特征抑制的轻量级人体姿态估计网络

吕超,马歌谣

长春理工大学

收稿日期:2025-06-24 修回日期:2025-09-05 发布日期:2025-09-17 出版日期:2025-09-17
通讯作者: 吕超
基金资助:
吉林省自然科学基金;国家重点研发计划

Lightweight human pose estimation network based on redundant feature suppression

Received:2025-06-24 Revised:2025-09-05 Online:2025-09-17 Published:2025-09-17

摘要/Abstract

摘要： 针对现有人体姿态估计网络在复杂场景下难以兼顾计算效率与定位精度的问题，提出一种基于冗余特征抑制的轻量级人体姿态估计网络。将其命名为LE-SHNet（Lightweight Enhanced Stacked Hourglass Network）。首先，在沙漏模块中设计多重分离沙漏模块（MSHM），通过异构卷积分支差异化建模大关节与末端肢体特征，并有效抑制冗余计算；其次，在沙漏模块之间引入混洗高效通道注意力（SECA），融合通道混洗与自适应卷积，以零参数量强化跨层级关节点关联；最后，在非沙漏模块中构建空间通道感知模块（SCPM），利用空间通道重构与三重注意力机制增强关键区域的感知能力。实验结果表明，该网络在MPII（Max Planck Institute for Informatics）和COCO2017（Common Objects in COntext 2017）数据集上分别达到88.7%和71.3%精度，较基线网络2-SHNet(2 Stacked Hourglass Network)在参数量上减少49.3%，计算量降低28.2%，精度提升1.1个百分点。与2024和2025年提出的轻量级人体姿态估计网络EL-HRNet（Efficient and Lightweight High-Resolution Network）和MobileMultiPose（Mobile-friendly and Multi-feature aggregation Pose estimation）相比，LE-SHNet的精度提升1.0和0.8个百分点，同时参数量减少32.0%和26.7%。LE-SHNet在保持轻量化的同时提升了关键点定位精度，具有在边缘设备实时部署中的潜在应用价值，可广泛用于智能监控、人机交互及运动康复等场景。

关键词: 计算机视觉, 人体姿态估计, 多重分离沙漏模块, 混洗高效通道注意力, 空间通道感知模块, 冗余特征抑制, 多尺度特征融合

Abstract: A lightweight human pose estimation network based on redundant feature suppression was proposed to address the difficulty of balancing computational efficiency and localization accuracy in complex scenarios. It was termed LE-SHNet(Lightweight Enhanced Stacked Hourglass Network). First, the Multiple Separated Hourglass Module (MSHM) was designed to employ heterogeneous convolution branches for differential modeling of large joints and distal limbs, while suppressing redundant computations. Then, the Shuffle Efficient Channel Attention (SECA) was integrated between hourglass modules, which combines channel shuffling and adaptive kernel convolution to enhance long-range joint correlations with zero additional parameters. Finally, the Spatial and Channel Perception Module (SCPM) was constructed in non-hourglass modules to strengthen spatial attention and channel responses by introducing spatial-channel reconstruction and triplet attention. Experimental results show that LE-SHNet achieves accuracy scores of 88.7% on Max Planck Institute for Informatics （MPII）and 71.3% on Common Objects in COntext 2017(COCO2017), reducing parameters by 49.3% and computational cost by 28.2% compared with the baseline 2 Stacked Hourglass Network (2-SHNet). Compared with the lightweight human pose estimation networks proposed in 2024 and 2025, namely EL-HRNet (Efficient and Lightweight High-Resolution Network) and MobileMultiPose (Mobile-friendly and Multi-feature aggregation Pose estimation), LE-SHNet achieves accuracy improvements of 1.0 and 0.8 percentage points, while reducing the number of parameters by 32.0% and 26.7%, respectively. These findings indicate that LE-SHNet maintains lightweight properties while significantly improving keypoint localization accuracy, making it suitable for real-time deployment on edge devices with promising applications in intelligent surveillance, human–computer interaction, and rehabilitation monitoring.

Key words: computer vision, human pose estimation, Multiple Separated Hourglass Module(MSHM), Shuffle Efficient Channel Attention(SECA), Spatial and Channel Perception Module(SCPM), redundant feature suppression, Multi-scale feature fusion

中图分类号:

中图分类号:TP391.41

吕超马歌谣. 基于冗余特征抑制的轻量级人体姿态估计网络[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2025060700.

[1]	梁一鸣, 范菁, 柴汶泽. 基于双向交叉注意力的多尺度特征融合情感分类[J]. 《计算机应用》唯一官方网站, 2025, 45(9): 2773-2782.
[2]	陈亮, 王璇, 雷坤. 复杂场景下跨层多尺度特征融合的安全帽佩戴检测算法[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2333-2341.
[3]	王向, 崔倩倩, 张晓明, 王建超, 王震洲, 宋佳霖. 改进ConvNeXt的无线胶囊内镜图像分类模型[J]. 《计算机应用》唯一官方网站, 2025, 45(6): 2016-2024.
[4]	陈子和, 陈斌. 基于多表征融合的无监督点云异常检测[J]. 《计算机应用》唯一官方网站, 2025, 45(5): 1677-1685.
[5]	郭诗月, 党建武, 王阳萍, 雍玖. 结合注意力机制和多尺度特征融合的三维手部姿态估计[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1293-1299.
[6]	张众维, 王俊, 刘树东, 王志恒. 多尺度特征融合与加权框融合的遥感图像目标检测[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 633-639.
[7]	曾正东, 赵明. 基于图注意力机制的三维人体姿态估计时空上下文网络[J]. 《计算机应用》唯一官方网站, 2025, 45(10): 3161-3169.
[8]	杨建锋, 陈斌, 李雨轩. 基于点云重构的自监督点云异常检测方法[J]. 《计算机应用》唯一官方网站, 2025, 45(10): 3302-3310.
[9]	李卓然, 李华, 王桐, 蒋朝哲. 基于融合特征状态空间模型的轻量化人体姿态估计[J]. 《计算机应用》唯一官方网站, 2025, 45(10): 3179-3186.
[10]	尹学辉, 傅林琳, 周尚波. 渐进式上下文交互和注意力机制的混凝土路面裂缝检测网络[J]. 《计算机应用》唯一官方网站, 2025, 45(10): 3353-3362.
[11]	陈俊颖, 郭士杰, 陈玲玲. 基于解耦注意力与幻影卷积的轻量级人体姿态估计[J]. 《计算机应用》唯一官方网站, 2025, 45(1): 223-233.
[12]	刘赏, 周煜炜, 代娆, 董林芳, 刘猛. 融合注意力和上下文信息的遥感图像小目标检测算法[J]. 《计算机应用》唯一官方网站, 2025, 45(1): 292-300.
[13]	宋鹏程, 郭立君, 张荣. 利用局部-全局时间依赖的弱监督视频异常检测[J]. 《计算机应用》唯一官方网站, 2025, 45(1): 240-246.
[14]	潘烨新, 杨哲. 基于多级特征双向融合的小目标检测优化模型[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2871-2877.
[15]	付帅, 郭小英, 白茹意, 闫涛, 陈斌. 改进的CloFormer模型与有序回归相结合的年龄评估方法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2372-2380.

基于冗余特征抑制的轻量级人体姿态估计网络

Lightweight human pose estimation network based on redundant feature suppression

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics