《计算机应用》唯一官方网站 ›› 2025, Vol. 45 ›› Issue (2): 403-410.DOI: 10.11772/j.issn.1001-9081.2024010070

• 人工智能 • 上一篇    

小样本相似性匹配特征增强的密集目标计数网络

谢斌红1, 高婉银1(), 陆望东2, 张英俊1, 张睿1   

  1. 1.太原科技大学 计算机科学与技术学院,太原 030024
    2.山西天河云计算有限公司,山西 吕梁 033000
  • 收稿日期:2024-01-22 修回日期:2024-03-27 接受日期:2024-03-29 发布日期:2024-05-09 出版日期:2025-02-10
  • 通讯作者: 高婉银
  • 作者简介:谢斌红(1971—),男,山西运城人,教授,硕士,CCF会员,主要研究方向:智能化软件工程、机器学习
    陆望东(1970—),男,上海人,高级工程师,硕士,主要研究方向:信号与信息系统
    张英俊(1969—),男,山西运城人,教授级高级工程师,硕士,主要研究方向:智能感知决策
    张睿(1987—),男,山西太原人,副教授,博士,主要研究方向:智能化软件工程。
  • 基金资助:
    山西省基础研究计划项目(20210302123216);吕梁市引进高层次科技人才重点研发项目(2022RC08)

Dense object counting network with few-shot similarity matching feature enhancement

Binhong XIE1, Wanyin GAO1(), Wangdong LU2, Yingjun ZHANG1, Rui ZHANG1   

  1. 1.College of Computer Science and Technology,Taiyuan University of Science and Technology,Taiyuan Shanxi 030024,China
    2.Shanxi Tianhe Cloud Computing Company Limited,Lvliang Shanxi 033000,China
  • Received:2024-01-22 Revised:2024-03-27 Accepted:2024-03-29 Online:2024-05-09 Published:2025-02-10
  • Contact: Wanyin GAO
  • About author:XIE Binhong, born in 1971, M. S., professor. His research interests include intelligent software engineering, machine learning.
    LU Wangdong, born in 1970, M. S., senior engineer. His research interests include signal and information system.
    ZHANG Yingjun, born in 1969, M. S., professor-level senior engineer. His research interests include intelligent perception decision-making.
    ZHANG Rui, born in 1987, Ph. D., associate professor. His research interests include intelligent software engineering.
  • Supported by:
    Basic Research Program of Shanxi Province(20210302123216);Key Research and Development Project of Lvliang City for High-Level Scientific and Technological Talents Introduction(2022RC08)

摘要:

为了解决训练数据有限且类别多的问题,引入小样本学习方法。针对现有密集目标计数方法中存在的密集物体边界不清晰、空间不一致性和模型泛化能力弱等问题,提出一种小样本相似性匹配特征增强密集目标计数网络(SMFENet)。首先,通过特征提取模块提取图像特征,并使用ROI Align方法对齐样例特征;其次,设计相似性比较特征增强模块(SCFEM)计算样例特征和图像特征的相似度,得到相似度图,并将该图作为加权系数用样例特征自适应地增强图像特征,使最终得到的增强特征更关注与样例特征相似的区域;同时,采用内部特征增强、内部尺度增强以及信息合并等方法解决密集物体边界不清晰和空间不一致性问题;最后,利用密度预测模块生成密度图。此外,采用内容感知标注法生成高质量Ground-Truth密度图,以进一步提升模型的准确性。测试时,通过自适应损失调整网络使网络泛化到新类别上。在FSC-147数据集和CARPK数据集上的实验结果表明,与现有的小样本目标计数方法相比,所提模型的平均绝对误差(MAE)降低到13.82,均方根误差(RMSE)降低到45.91;与特定类别计数方法相比,所提模型的MAE降低到4.16,RMSE降低到5.91。以上充分证明SMFENet模型在提高计数的准确性和鲁棒性等方面能取得较好的效果,展示了该模型的实际应用价值。

关键词: 密集目标计数, 小样本学习, 密度预测, 相似性匹配特征增强

Abstract:

In order to address the challenges of limited training data and diverse categories, a few-shot learning method was introduced. In view of the problems existing in dense object counting methods, such as unclear boundaries and spatial inconsistency of dense objects as well as weak generalization capability of model, a few-shot Similarity Matching Feature Enhancement dense object counting Network (SMFENet) was proposed. Firstly, image features were extracted through the feature extraction module, and sample features were aligned using ROI Align. Secondly, a Similarity Comparison Feature Enhancement Module (SCFEM) was designed to calculate similarity between sample features and image features, resulting in a similarity graph. This graph was used as weighting coefficients to enhance the image features adaptively with the sample features, so as to obtain the final enhanced features focusing more on regions with features similar to the sample features. At the same time, methods such as internal feature enhancement, internal scale enhancement and information fusion were employed to solve the problems of unclear boundaries and spatial inconsistency of dense objects. Finally, a density map was generated using the density prediction module. Additionally, the content-aware annotation method was used to generate high-quality Ground-Truth density maps to further improve the model accuracy. During test, the network was adjusted by adaptive loss to generalize to new categories. Experimental results on FSC-147 dataset and CARPK dataset show that compared with the existing few-shot counting methods, the proposed model has the Mean Absolute Error (MAE) reduced to 13.82 and Root Mean Squared Error (RMSE) reduced to 45.91, compared with class-specific counting method, the proposed model has the MAE reduced to 4.16 and RMSE reduced to 5.91. The above fully proves that SMFENet model can achieve good results in improving the accuracy and robustness of counting, demonstrates the practical application value of the model.

Key words: dense object counting, few-shot learning, density prediction, similarity matching feature enhancement

中图分类号: