《计算机应用》唯一官方网站

• •    下一篇

基于正态逆伽马分布的多尺度融合人群计数算法

李伟1,张晓蓉1,陈鹏1,李清2,张长青3   

  1. 1. 中国电子科技集团公司第二十八研究所
    2. 天津大学
    3. 天津大学计算机科学与技术学院
  • 收稿日期:2023-06-19 修回日期:2023-08-24 发布日期:2023-09-04 出版日期:2023-09-04
  • 通讯作者: 李清

Multi-scale fusion crowd counting algorithm based on normal inverse Gamma distribution

  • Received:2023-06-19 Revised:2023-08-24 Online:2023-09-04 Published:2023-09-04

摘要: 针对人群分析任务中往往存在的因监控与人群距离不同而导致的尺度变化大的问题,提出一种可靠性多尺度融合人群计数框架。首先,该框架在使用传统骨架提取公共特征后,通过多尺度信息提取模块获得图片中不同尺度的行人信息;其次,每个尺度的网络各自包含一个人群密度估计模块和一个用于评估每个尺度预测结果可信度的不确定估计模块;然后,多尺度预测融合模块依据可信度对多尺度预测结果进行动态融合以获得更准确的密度回归结果。实验结果表明,现有方法在通过多尺度可信融合扩展后,人群计数的误差显著降低,验证了融合策略的合理性和有效性。除此之外,与现有方法不同,该框架不仅可以预测人群密度,还可以在部署阶段提供预测的可信程度,从而使得算法在实际应用中能及时预警出模型预测不准确的区域,降低后续分析任务出现错误预判的风险。

关键词: 人群计数, 多尺度, 可信融合, 人群密度估计, 不确定性

Abstract: To solve the problem of large variation caused by different distances between monitoring camera and crowd in the crowd analysis tasks, a crowd counting framework based on trusted multi-scale fusion is proposed. Firstly, the framework extracts the common features with the traditional skeleton and then obtains the pedestrian information of different scales with the multi-scale information extraction module; Secondly, each scale network contains a population density estimation module and an uncertainty estimation module for evaluating the reliability of the prediction results of each scale; Then, the multi-scale prediction fusion module dynamically fuses the multi-scale prediction results based on the reliability to obtain more accurate density regression results. The experimental results show that the error of crowd counting is significantly reduced after the expansion of the existing method by multi-scale trusted fusion, which verifies the necessity, extensibility, rationality and effectiveness of the fusion strategy. In addition, different from the existing methods, the framework can not only predict the population density, but also provide the reliability of the prediction during the deployment stage, so that the algorithm can timely warn the inaccurate areas predicted by the model in practical application, and reduce the risk of avoiding the wrong prediction in subsequent analysis tasks.

Key words: crowd counting, multi-scale, trustworthy fusion, crowd density estimation, uncertainty

中图分类号: