Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (6): 1989-1997.DOI: 10.11772/j.issn.1001-9081.2025050663

• Multimedia computing and computer simulation • Previous Articles    

YOLO-AirPose: human pose estimation algorithm in UAV aerial view

Qiuyan YIN1, Jing DING2(), Zhigang NIE1   

  1. 1.College of Information Science and Technology,Gansu Agricultural University,Lanzhou Gansu 730070,China
    2.Department of Physical Education,Gansu Agricultural University,Lanzhou Gansu 730070,China
  • Received:2025-06-23 Revised:2025-09-13 Accepted:2025-09-18 Online:2025-09-25 Published:2026-06-10
  • Contact: Jing DING
  • About author:YIN Qiuyan, born in 1998, M. S. candidate. Her research interests include pose recognition, object detection.
    NIE Zhigang, born in 1980, Ph. D., professor. His research interests include computer vision, smart agriculture.
    First author contact:DING Jing, born in 1979, M. S., associate professor. Her research interests include sports training and rehabilitation, human posture correction.
  • Supported by:
    Youth Mentor Support Program of Gansu Agricultural University(GAU-QDFC-2022-19);Top-notch Talent Program of Gansu Province(GSBJLJ-2023-09)

无人机航拍视角下的人体姿态估计算法YOLO-AirPose

尹秋燕1, 丁婧2(), 聂志刚1   

  1. 1.甘肃农业大学 信息科学技术学院,兰州 730070
    2.甘肃农业大学 体育教学部,兰州 730070
  • 通讯作者: 丁婧
  • 作者简介:尹秋燕(1998—),女,山东聊城人,硕士研究生,CCF会员,主要研究方向:姿态识别、目标检测
    聂志刚(1980—),男,甘肃张掖人,教授,博士,主要研究方向:计算机视觉、智慧农业。
    第一联系人:丁婧(1979—),女,陕西咸阳人,副教授,硕士,主要研究方向:运动训练与康复、人体姿态矫正
  • 基金资助:
    甘肃农业大学青年导师扶持项目(GAU-QDFC-2022-19);甘肃省拔尖领军人才项目(GSBJLJ-2023-09)

Abstract:

To address the challenges of background interference, keypoint localization deviation, and target occlusion in Unmanned Aerial Vehicle (UAV) aerial view human pose estimation, an enhanced human pose estimation algorithm named YOLO-AirPose was proposed for non-ground view scenarios. Firstly, a symmetric flip augmentation strategy based on keypoint topology constraint, named IPSFA (Index-Preserved Symmetric Flip Augmentation), was designed to improve generalization under multi-view scenarios. Secondly, a C2BRA (C2 Bi-level Routing Attention) module was constructed by integrating BRA (Bi-level Routing Attention) mechanism to replace the original C2PSA (Cross stage Partial with Spatial Attention), thereby enhancing the model’s perception of small-scale targets and occluded keypoints. Thirdly, combining spatial modeling ability of Transformer, an AIFI (Adaptive Interaction Feature Integration) module was embedded into the backbone network, so that 2D positional encoding was combined to improve keypoint localization performance. Finally, a C3k2-DAttention module based on deformable attention mechanism was designed to strengthen the network’s global modeling and receptive field adjustment abilities. Experimental results show that YOLO-AirPose achieves improvements of 3.0, 5.0, 4.6, and 6.8 percentage points in precision of object detection and precision, recall, and mAP@0.5 of pose estimation compared to the baseline model YOLO-Pose, respectively, while maintaining low computational cost and parameter quantity. It can be seen that the proposed algorithm provides an improved solution to the accuracy limitations in UAV aerial view human pose estimation and enhances adaptability to complex human poses.

Key words: human pose estimation, Unmanned Aerial Vehicle (UAV), YOLOv11-Pose, object detection, region awareness

摘要:

针对无人机(UAV)航拍视角下人体姿态估计中存在的复杂背景干扰、关键点定位偏移和目标遮挡等问题,提出一种适用于非地面视角下的增强型人体姿态估计算法YOLO-AirPose。首先,设计基于关键点拓扑约束的对称翻转增强策略IPSFA (Index-Preserved Symmetric Flip Augmentation),提升多视角场景下的泛化能力;其次,融合BRA(Bi-level Routing Attention)机制构建C2BRA(C2 Bi-level Routing Attention)模块替代原有的C2PSA(Cross stage Partial with Spatial Attention),增强模型对小尺寸目标与遮挡关键点的表达能力;再次,结合Transformer的空间建模能力,将AIFI(Adaptive Interaction Feature Integration)模块嵌入主干网络,以结合2D位置编码优化关键点定位性能;最后,设计基于可变形注意力机制的C3k2-DAttention模块,以增强网络的全局建模与感受野调控能力。实验结果表明,在保持较低计算量和较低参数量的前提下,YOLO-AirPose在目标检测的精确率以及姿态估计的精确率、召回率和mAP@0.5上相较于基准模型YOLO-Pose分别提升了3.0以及5.0、4.6和6.8个百分点。可见,所提算法为UAV俯视视角下人体姿态估计精度不足问题提供了改进方案,同时还提升了对人体复杂姿态的适应能力。

关键词: 人体姿态估计, 无人机, YOLOv11-Pose, 目标检测, 区域感知

CLC Number: