《计算机应用》唯一官方网站 ›› 2025, Vol. 45 ›› Issue (7): 2195-2202.DOI: 10.11772/j.issn.1001-9081.2024060862

• 人工智能 • 上一篇    下一篇

开放生成与特征优化的开集识别方法

向尔康1, 黄荣1,2, 董爱华1,2()   

  1. 1.东华大学 信息科学与技术学院,上海 201620
    2.数字化纺织服装技术教育部工程研究中心(东华大学),上海 201620
  • 收稿日期:2024-06-25 修回日期:2024-09-14 接受日期:2024-09-18 发布日期:2025-07-10 出版日期:2025-07-10
  • 通讯作者: 董爱华
  • 作者简介:向尔康(2000—),男(土家族),湖南湘西人,硕士研究生,主要研究方向:图像识别、开集识别
    黄荣(1985—),男,浙江绍兴人,副教授,博士,主要研究方向:图像理解、开集识别
    董爱华(1970—),女,上海嘉定人,副教授,博士,主要研究方向:智能纺织服装。dongaihua@dhu.edu.cn
  • 基金资助:
    国家自然科学基金资助项目(62001099);中央高校基本科研业务费专项资金资助项目(2232023D-30)

Open set recognition method with open generation and feature optimization

Erkang XIANG1, Rong HUANG1,2, Aihua DONG1,2()   

  1. 1.College of Information Science and Technology (Donghua University),Shanghai 201620,China
    2.Engineering Research Center of Digitized Textile and Apparel Technology,Ministry of Education (Donghua University),Shanghai 201620,China
  • Received:2024-06-25 Revised:2024-09-14 Accepted:2024-09-18 Online:2025-07-10 Published:2025-07-10
  • Contact: Aihua DONG
  • About author:XIANG Erkang, born in 2000, M. S. candidate. His research interests include image recognition, open set recognition.
    HUANG Rong, born in 1985, Ph. D., associate professor. His research interests include image understanding, open set recognition.
    DONG Aihua, born in 1970, Ph. D., associate professor. Her research interests include smart textiles and clothing.
  • Supported by:
    National Natural Science Foundation of China(62001099);Fundamental Research Funds for Central Universities(2232023D-30)

摘要:

当深度神经网络(DNN)遇到训练时未遇见的类别的样本时,不能准确地拒绝未知类样本,而开集识别能在准确分类已知类样本同时拒绝未知类样本。目前在开集识别领域,原型学习方法广为应用,然而这些方法都无法同时保证样本分布内的紧凑性和样本分布间的分离性。因此,提出开放生成与特征优化的开集识别方法(OGFO)。首先,提出开放点的概念,原型点通过DNN学习对应类别样本的固有特征而开放点是各类别原型点的均值。开放点代表未知类的固有特征且占据特征空间的中心区域。特征空间中心区域为未知类样本分布的开放空间;其次,提出基于开放点的特征优化算法(FOA),从而利用开放点强迫相同类别样本内部的分布更加紧凑并且迫使不同类别样本间的分布更加分离;最后,提出基于开放点的生成方法OGAN(Open Generative Adversarial Network),并使用DNN迫使OGAN生成的未知类样本分布在开放点占据的开放空间中。实验结果表明,相较于基于对抗性反向点学习的开集识别方法(ARPL),OGFO在MNIST、SVHN、CIFAR10和TinyImageNet数据集上的AUROC(Area Under the Receiver Operating Characteristic curve)提升明显,尤其在TinyImageNet数据集上的AUROC上至少提升了3个百分点,在准确率和OSCR(Open Set Classification Rate)上分别至少提升6和5个百分点。可见,OGFO解决了其他方法无法兼顾样本分布内的紧凑性和样本分布间的分离性的问题。

关键词: 特征优化, 开集识别, 开放点, 原型学习, 深度神经网络, 生成器

Abstract:

Deep Neural Network (DNN) may encounter samples that are not present during training, and there are difficulties in rejecting unknown class samples accurately. Open set recognition can classify known class samples accurately while rejecting unknown class samples. In the current domain of open set recognition, prototype learning methods have gained widespread applications. However, these methods cannot ensure intra-class compactness and inter-class separability of sample distribution simultaneously. Therefore, an open set recognition method with Open Generation and Feature Optimization (OGFO) was proposed. Firstly, the concept of open point was presented, and the inherent features of corresponding class samples were learned by prototype points through DNN. The average of prototype points for each class was open point, representing the inherent features of unknown classes, and the central region of the feature space was occupied by the open point as open space for the distribution of unknown class samples. Then, a Feature Optimization Algorithm based on open points (FOA) was proposed, so that open points were utilized to force a more compact distribution within the same class samples, and a more separable distribution among different class samples. Finally, an open point based generation method OGAN (Open Generative Adversarial Network) was proposed, and the unknown class samples generated by OGAN were forced by DNN to distribute in the open space occupied by open points. Experimental results demonstrate that compared with Adversarial Reciprocal Points Learning for open set recognition (ARPL) on datasets such as MNIST, SVHN, CIFAR10, and TinyImageNet, OGFO has significant improvements in AUROC (Area Under the Receiver Operating Characteristic curve). Especially, on TinyImageNet dataset, OGFO improves the AUROC by at least 3 percentage points and has improvements of at least 6 and at least 5 percentage points, respectively, in accuracy and OSCR (Open Set Classification Rate) compared to ARPL. It can be seen that the challenge which other methods cannot address: balancing intra-distribution compactness and inter-distribution separability of sample distribution simultaneously is addressed by OGFO.

Key words: feature optimization, open set recognition, open point, prototype learning, Deep Neural Network (DNN), generator

中图分类号: