Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (6): 1981-1988.DOI: 10.11772/j.issn.1001-9081.2025060705

• Multimedia computing and computer simulation • Previous Articles    

Human dimension attention regressor method for monocular occluded human mesh recovery

Menghua WANG1,2, Yukun DONG1,2(), Long CHENG1,2, Junqi SUN1,2   

  1. 1.Qingdao Institute of Software,College of Computer Science and Technology,China University of Petroleum (East China),Qingdao Shandong 266580,China
    2.Shandong Key Laboratory of Intelligent Oil and Gas Industrial Software (China University of Petroleum (East China)),Qingdao Shandong 266580,China
  • Received:2025-06-24 Revised:2025-08-31 Accepted:2025-09-02 Online:2025-09-09 Published:2026-06-10
  • Contact: Yukun DONG
  • About author:WANG Menghua, born in 2001, M. S. candidate. His research interests include machine vision, human mesh recovery.
    CHENG Long, born in 2000, M. S. candidate. His research interests include deep learning, machine vision.
    SUN Junqi, born in 2002, M. S. candidate. His research interests include deep learning, machine vision.
    First author contact:DONG Yukun, born in 1981, Ph. D., associate professor. His research interests include deep learning, software testing.
  • Supported by:
    Shandong Provincial National Science Foundation(ZR2024MF129)

用于单目遮挡人体网格恢复的人体尺寸注意力回归方法

王梦华1,2, 董玉坤1,2(), 程龙1,2, 孙骏骐1,2   

  1. 1.中国石油大学(华东) 青岛软件学院、计算机科学与技术学院,山东 青岛 266580
    2.山东省智能油气工业软件重点实验室(中国石油大学(华东)),山东 青岛 266580
  • 通讯作者: 董玉坤
  • 作者简介:王梦华(2001—),男,河南郑州人,硕士研究生,CCF会员,主要研究方向:机器视觉、人体网格重建
    程龙(2000—),男,山东烟台人,硕士研究生,主要研究方向:深度学习、机器视觉
    孙骏骐(2002—),男,安徽淮南人,硕士研究生,主要研究方向:深度学习、机器视觉。
    第一联系人:董玉坤(1981—),男,山东济宁人,副教授,博士,CCF会员,主要研究方向:深度学习、软件测试
  • 基金资助:
    山东省自然科学基金资助项目(ZR2024MF129);山东省自然科学基金资助项目(ZR2023MF041)

Abstract:

In real-world scenarios, human images are often occluded by clothing, self-posture, and environmental objects, leading to insufficient visible information, so that the existing human reconstruction methods tend to degrade to mean models in shape modelling, failing to recover real individual characteristics faithfully. To address this issue, a Human Dimension Attention Regressor (HDAR) method for monocular occluded human mesh recovery was proposed. Firstly, human dimensions in the visible region were used to infer the dimensions of occluded parts. Secondly, a hierarchical proportion constraint was introduced, in which first-level constraints were applied to adjacent body parts and second-level constraints were applied to distant body parts, thereby ensuring that the regressed shapes conform to human structural characteristics. Finally, Two-Dimensional (2D) joint information was integrated with the body dimension information for iterative optimization, so as to improve pose estimation accuracy. Experimental results on the 3DPW (Three-Dimensional (3D) Poses in the Wild) dataset show that, the proposed method achieves a Per Vertex Error (PVE) of 65.2 mm, which is 10.7 mm lower than that of Multi-HMR (Multi-person whole-body Human Mesh Recovery) under occlusion conditions, corresponding to a 14.1% error reduction. Visualization experimental results demonstrate that the proposed method improves the reconstruction accuracy of human shape and pose in complex occlusion scenarios effectively.

Key words: monocular occluded human reconstruction, human dimension, human measurement, occluded human, hierarchical proportion constraint, iterative optimization

摘要:

在真实场景中,人体图像常受服装、自身姿态及环境物体的遮挡,导致可见信息不足,使现有人体重建方法在形状建模上易退化为均值模型,难以真实还原个体真实特征。针对这一问题,提出一种用于单目遮挡人体网格恢复的人体尺寸注意力回归方法(HDAR)。首先,利用可见区域的人体尺寸推理被遮挡部分的尺寸信息;其次,引入人体维度的分级比例约束,在邻近部位间建立一级约束,在较远部位间建立二级约束,使回归形状符合人体结构特征;最后,结合二维(2D)关节点信息与人体尺寸进行迭代优化,提升姿态估计精度。在3DPW(Three-Dimensional Poses in the Wild)数据集上的实验结果表明,该方法的逐顶点误差(PVE)为65.2 mm,相较于Multi-HMR(Multi-person whole-body Human Mesh Recovery)在遮挡状态下减小了10.7 mm,即减小了14.1%的误差。可视化实验的结果表明,所提方法能够在复杂遮挡场景下有效提升人体形状与姿态的重建精度。

关键词: 单目遮挡人体重建, 人体尺寸, 人体测量, 遮挡人体, 分级比例约束, 迭代优化

CLC Number: