《计算机应用》唯一官方网站

• •    下一篇

基于多尺度记忆库的像素级无监督工业异常检测

刘永江1,陈斌2   

  1. 1. 中国科学院成都计算机应用研究所
    2. 中国科学院大学;哈尔滨工业大学(深圳)
  • 收稿日期:2023-12-08 修回日期:2024-03-01 接受日期:2024-03-07 发布日期:2024-03-12 出版日期:2024-03-12
  • 通讯作者: 刘永江

Pixel-level Unsupervised Industrial Anomaly Detection Based on Multi-Scale Memory Bank

  • Received:2023-12-08 Revised:2024-03-01 Accepted:2024-03-07 Online:2024-03-12 Published:2024-03-12

摘要: 基于特征嵌入的无监督异常检测方法通常使用patch级特征来定位异常。尽管patch级特征在图像级异常检测任务上具有竞争力,但在像素级定位方面仍存在精度不足的问题。为解决这一问题,提出了MemAD,一种像素级异常检测方法,由多尺度记忆库与分割网络组成。首先,通过预训练的特征提取网络对训练集中的正常样本进行特征提取,构建三个尺度下的正样本特征记忆库。其次,在训练分割网络时,采用模拟的伪异常样本特征与记忆库中距离最近的正样本特征计算差特征,进一步引导分割网络学习如何定位异常像素。实验结果表明,MemAD在MVTec数据集上的图像级和像素级AUROC分别达到了0.98和0.974,优于大多数现有方法,证实了其在像素级异常定位中的准确性。该方法为无监督异常检测领域提供了一种新的思路和方法,具有广泛的应用前景。

关键词: 计算机视觉, 无监督异常检测, 特征嵌入, 记忆库, 语义分割

Abstract: Unsupervised anomaly detection methods based on feature embedding often use patch-level features to localize anomalies. Although patch-level features are competitive in image-level anomaly detection tasks, there is still a lack of accuracy in pixel-level localization. To address this issue, propose MemAD, a pixel-level anomaly detection method composed of a multi-scale memory bank and a segmentation network. First, a pre-trained feature extraction network is used to extract features from normal samples in the training set, constructing a feature memory bank at three different scales. Then, during the training of the segmentation network, simulated pseudo-anomaly sample features are used to calculate the difference features with the nearest normal sample features in the memory bank, further guiding the segmentation network to learn how to locate anomalous pixels. Experimental results show that MemAD achieves image-level and pixel-level AUROC scores of 0.98 and 0.974, respectively, on the MVTec dataset, outperforming most existing methods and confirming its accuracy in pixel-level anomaly localization. This method provides a new approach and technique for unsupervised anomaly detection and has a wide range of applications.

Key words: computer vision, unsupervised anomaly detection, feature embedding, memory bank, semantic segmentation

中图分类号: