Journal of Computer Applications ›› 2025, Vol. 45 ›› Issue (7): 2203-2210.DOI: 10.11772/j.issn.1001-9081.2024070944

• Artificial intelligence • Previous Articles     Next Articles

Gradient-discriminative and feature norm-driven open-world object detection

Yingjun ZHANG1, Weiwei YAN1(), Binhong XIE1, Rui ZHANG1, Wangdong LU2   

  1. 1.College of Computer Science and Technology,Taiyuan University of Science and Technology,Taiyuan Shanxi 030024,China
    2.Shanxi Tianhe Cloud Computing Company Limited,Lyuliang Shanxi 033000,China
  • Received:2024-07-08 Revised:2024-10-09 Accepted:2024-10-09 Online:2025-07-10 Published:2025-07-10
  • Contact: Weiwei YAN
  • About author:ZHANG Yingjun, born in 1969, M. S., professor of engineering. His research interests include intelligent software, software architecture.
    YAN Weiwei, born in 1999, M. S. candidate. Her research interests include open-world object detection.
    XIE Binhong, born in 1971, M. S., professor. His research interests include intelligent software, machine learning.
    ZHANG Rui, born in 1987, Ph. D., associate professor. His research interests include intelligent information processing.
    LU Wangdong, born in 1970, M. S., senior engineer. His research interests include signal and information system.
  • Supported by:
    Basic Research Program of Shanxi Province(20210302123216);High-Level Scientific and Technological Talents Introduction Key Research and Development Project of Lyuliang City(2022RC08)

梯度区分与特征范数驱动的开放世界目标检测

张英俊1, 闫薇薇1(), 谢斌红1, 张睿1, 陆望东2   

  1. 1.太原科技大学 计算机科学与技术学院,太原 030024
    2.山西天河云计算有限公司,山西 吕梁 033000
  • 通讯作者: 闫薇薇
  • 作者简介:张英俊(1969—),男,山西河津人,教授级高级工程师,硕士,主要研究方向:智能化软件、软件体系结构
    闫薇薇(1999—),女,山西晋城人,硕士研究生,主要研究方向:开放世界目标检测 yww1374670805@163.com
    谢斌红(1971—),男,山西万荣人,教授,硕士,主要研究方向:智能化软件、机器学习
    张睿(1987—),男,山西太原人,副教授,博士,主要研究方向:智能信息处理
    陆望东(1970—),男,山西吕梁人,高级工程师,硕士,主要研究方向:信号与信息系统。
  • 基金资助:
    山西省基础研究计划项目(20210302123216);吕梁市引进高层次科技人才重点研发项目(2022RC08)

Abstract:

Open-World Object Detection (OWOD) extends the object detection task to real and variable environments, and requires models to identify known and unknown objects accurately and learn new knowledge gradually. In response to the low recall for unknown classes and the problem of false identification in the existing OWOD methods, a Gradient-Discriminative and Feature Norm-driven OWOD (GDFN-OWOD) network model was proposed. To address the issue of low recall for unknown classes, a Gradient-Discriminative Representation Module (GDRM) was proposed, which uses the gradient difference from backpropagation to distinguish unknown classes from the background accurately, thereby improving the recall for unknown classes. In addition, a Graph Segmentation-based Bounding box Clustering (GSBC) algorithm was introduced to model the determination of object bounding boxes as a graph decomposition problem, thereby reducing redundant bounding boxes, and thus reducing the computational complexity of the model. To tackle the problem of false identification for unknown classes, a FeatureNorm-Based Classifier (FN-BC) was employed to select the best-performing convolutional layer to identity known and unknown classes for higher identification precision. Experimental results on M-OWODB dataset show that compared with the best performance of comparison models in tasks T1T2, and T3, GDFN-OWOD has the recall for unknown classes increased by 1.1, 2.1, and 0.9 percentage points, respectively, and the Absolute Open-Set Error (A-OSE) reduced by 35.1%, 28.7%, and 12.2%, respectively. It can be seen that compared with the existing OWOD methods, the proposed method alleviates the problems of low recall for unknown classes and false identification effectively.

Key words: Open-World Object Detection (OWOD), backpropagation gradient, graph segmentation algorithm, feature norm, Convolutional Neural Network (CNN)

摘要:

开放世界目标检测(OWOD)将目标检测任务拓展至真实多变的环境中,要求模型能准确识别已知和未知对象,并逐步学习新知识。针对现有OWOD网络模型中未知类的召回率偏低和误识别的问题,提出一种梯度区分与特征范数驱动的开放世界目标检测(GDFN-OWOD)网络模型。针对未知类召回率偏低的问题,提出梯度区分性表征模块(GDRM),即利用反向传播的梯度差异区分未知类别和背景,以提高未知类召回率;此外,引入基于图分割的框聚类(GSBC)算法将物体边界框的确定建模为图分解问题,从而减少冗余的边界框,进而降低模型的计算量;针对未知类误识别的问题,采用基于特征范数的分类器(FN-BC)选择性能最优的卷积层识别已知和未知类别,以达到更高的识别准确率。在M-OWODB数据集上的实验结果表明,与最优对比模型相比在T1T2T3任务中GDFN-OWOD的未知类召回率分别提升了1.1、2.1、0.9个百分点,而绝对开集误差(A-OSE)分别降低了35.1%、28.7%和12.2%。可见,与现有的OWOD网络模型相比,所提网络模型有效缓解了未知类的召回率偏低和误识别的问题。

关键词: 开放世界目标检测, 反向传播梯度, 图分割算法, 特征范数, 卷积神经网络

CLC Number: