Spatio-temporal modeling and hierarchical feature enhancement for person re-identification

doi:10.11772/j.issn.1001-9081.2025081053

Journal of Computer Applications

Received:2025-09-11 Revised:2025-12-18 Online:2026-02-12 Published:2026-02-12
Contact: Ding-Li YANG
Supported by:
National Natural Science Foundation of China

时空建模与层次化特征增强的行人重识别算法

杨定礼,卫元芳,胡文瑞,孔力杨,于银山

淮阴工学院

通讯作者: 杨定礼
基金资助:
国家自然科学基金

Abstract

Abstract: To address the challenges of person re-identification (ReID) in complex scenarios such as occlusion, viewpoint variation, and pose changes, this paper proposes a spatio-temporal modeling and hierarchical feature enhancement framework. The proposed method employs a three-stage progressive feature optimization strategy to achieve a synergistic improvement in global consistency and local discriminability. First, after extracting appearance features with the backbone network, a Double Pooling Temporal Attention mechanism is introduced. This mechanism combines global average pooling and temporal average pooling to capture complementary information from sequence features, and performs spatio-temporal dependency modeling through channel–space interaction, thereby highlighting motion-related features and alleviating the problem of local information loss caused by occlusion. Second, to tackle the imbalance in body-part feature distribution, a flexible feature fusion module is designed. This module adaptively aggregates multi-part features through learnable weights, suppressing occlusion-induced noise while enhancing discriminative local representations, resulting in complementary global-local embeddings. Finally, a confidence calibration network based on residual learning is embedded before the classification layer to refine the distribution of identity prediction confidence, which effectively improves cross-camera retrieval accuracy. Comprehensive evaluations on public datasets such as Market-1501 and P-Duke-MTMC demonstrate that the proposed method achieves mAP scores of 93.2% and 86.8%, with Rank-1 accuracies of 97.4% and 95.2%, respectively. The results indicate that the integration of spatio-temporal modeling with hierarchical feature enhancement significantly improves ReID performance under challenging conditions.

Key words: Keywords: Person Re-identification, Spatio-Temporal Modeling, Flexible Feature Fusion, Feature Enhancement, Occlusion

摘要： 针对遮挡、视角变化及姿态变化等复杂场景下的行人重识别匹配困难问题，提出一种时空建模与层次化特征增强的行人重识别算法。所提算法通过三阶段渐进式特征优化框架，实现全局一致性与局部判别性的协同提升。首先，在骨干网络提取外观特征后，引入双池化时序注意力机制，该机制结合全局平均池化与时间平均池化来捕获序列特征的互补信息，并通过通道与空间交互进行时空依赖建模，从而突出运动相关特征并缓解遮挡导致的局部信息缺失问题。其次，针对人体部位特征分布不均问题，构建柔性特征融合模块，通过可学习权重自适应聚合多部位特征，抑制遮挡噪声并增强判别性局部特征，从而获得全局与局部层次化表示。最后，在分类层前设计置信度校正网络，通过残差学习优化身份预测置信度分布，提升跨摄像头检索精度。在 Market-1501和P-Duke-MTMC等公开数据集上进行系统评估，mAP分别达到 93.2%和 86.8%，Rank-1分别达到97.4%和 95.2%。结果表明，时空建模与层次化特征增强的结合能够显著提升复杂场景下的行人重识别性能。

关键词: 关键词: 行人重识别, 时空建模, 柔性特征融合, 特征增强, 遮挡

CLC Number:

TP391.4

杨定礼卫元芳胡文瑞孔力杨于银山. 时空建模与层次化特征增强的行人重识别算法[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2025081053.

[1]	Jiaxiang ZHANG, Xiaoming LI, Jiahui ZHANG. Few-shot object detection algorithm based on new category feature enhancement and metric mechanism [J]. Journal of Computer Applications, 2025, 45(9): 2984-2992.
[2]	Wei ZHANG, Jiaxiang NIU, Jichao MA, Qiongxia SHEN. Chinese spelling correction model ReLM enhanced with deep semantic features [J]. Journal of Computer Applications, 2025, 45(8): 2484-2490.
[3]	Haoyu LIU, Pengwei KONG, Yaoli WANG, Qing CHANG. Pedestrian detection algorithm based on multi-view information [J]. Journal of Computer Applications, 2025, 45(7): 2325-2332.
[4]	Weichao DANG, Yinghao FAN, Gaimei GAO, Chunxia LIU. Weakly supervised action localization based on temporal and global contextual feature enhancement [J]. Journal of Computer Applications, 2025, 45(3): 963-971.
[5]	Benchen YANG, Haoran LI, Haibo JIN. Multi-focus image fusion network with cascade fusion and enhanced reconstruction [J]. Journal of Computer Applications, 2025, 45(2): 594-600.
[6]	Binhong XIE, Wanyin GAO, Wangdong LU, Yingjun ZHANG, Rui ZHANG. Dense object counting network with few-shot similarity matching feature enhancement [J]. Journal of Computer Applications, 2025, 45(2): 403-410.
[7]	Jing ZHOU, Zhenyang TANG, Hui DONG, Xin LIU. Multi-label text classification method of power customer service work orders integrating feature enhancement and contrastive learning [J]. Journal of Computer Applications, 2025, 45(12): 3847-3854.
[8]	Jie WANG, Hua MENG. Image classification algorithm based on overall topological structure of point cloud [J]. Journal of Computer Applications, 2024, 44(4): 1107-1113.
[9]	Xinye LI, Yening HOU, Yinghui KONG, Zhiqi YAN. Few-shot object detection combining feature fusion and enhanced attention [J]. Journal of Computer Applications, 2024, 44(3): 745-751.
[10]	Jia CHEN, Hong ZHANG. Image text retrieval method based on feature enhancement and semantic correlation matching [J]. Journal of Computer Applications, 2024, 44(1): 16-23.
[11]	Guolong YUAN, Yujin ZHANG, Yang LIU. Image tampering forensics network based on residual feedback and self-attention [J]. Journal of Computer Applications, 2023, 43(9): 2925-2931.
[12]	Bin LU, Jielin LIU. Semantic segmentation for 3D point clouds based on feature enhancement [J]. Journal of Computer Applications, 2023, 43(6): 1818-1825.
[13]	Fang LUO, Yang LIU, G. T. S HO. Multi-scale ship detection based on adaptive feature fusion in complex scenes [J]. Journal of Computer Applications, 2023, 43(11): 3587-3593.
[14]	Wei ZHAO, Yi LI. Kinect-based human pose estimation optimization and animation generation [J]. Journal of Computer Applications, 2022, 42(9): 2830-2837.
[15]	Xiangyue TAN, Xiao HU, Jiaxin YANG, Junjiang XIANG. Camouflaged object detection based on progressive feature enhancement aggregation [J]. Journal of Computer Applications, 2022, 42(7): 2192-2200.