Journal of Computer Applications ›› 2024, Vol. 44 ›› Issue (11): 3364-3370.DOI: 10.11772/j.issn.1001-9081.2023111616

• Artificial intelligence • Previous Articles     Next Articles

Meta label correction method based on shallow network predictions

Yuxin HUANG1, Yiwang HUANG1,2(), Hui HUANG3   

  1. 1.School of Computer Science and Mathematics,Fujian University of Technology,Fuzhou Fujian 350118,China
    2.School of Data Science,Tongren University,Tongren Guizhou 554300,China
    3.Department of Modern Agricultural Technology,Fujian Vocational College of Agriculture,Fuzhou Fujian 350119,China
  • Received:2023-11-22 Revised:2024-02-22 Accepted:2024-03-08 Online:2024-03-12 Published:2024-11-10
  • Contact: Yiwang HUANG
  • About author:HUANG Yuxin, born in 1998, M. S. candidate. His research interests include noisy label learning, knowledge distillation.
    HUANG Hui, born in 1992, M. S., teaching assistant. His research interests include machine learning, automated monitoring and control of basin water quality.
  • Supported by:
    National Natural Science Foundation of China(62066040);Project of Tongren Munipal Science and Technology Bureau (Tongren City Scientific Research [2022]5)

基于浅层网络预测的元标签校正方法

黄雨鑫1, 黄贻望1,2(), 黄辉3   

  1. 1.福建理工大学 计算机科学与数学学院,福州 350118
    2.铜仁学院 大数据学院,贵州 铜仁 554300
    3.福建农业职业技术学院 现代农业技术学院,福州 350119
  • 通讯作者: 黄贻望
  • 作者简介:黄雨鑫(1998—),男,福建永泰人,硕士研究生,CCF会员,主要研究方向:噪声标签学习、知识蒸馏
    黄辉(1992—),男,福建永泰人,助教,硕士,主要研究方向:机器学习、流域水质自动化监测与控制。
  • 基金资助:
    国家自然科学基金资助项目(62066040);铜仁市科技局资助项目(铜仁市科研[2022]5号)

Abstract:

Aiming at overfitting problem caused by memory behavior of Deep Neural Networks (DNNs) on image data with noisy labels, a meta label correction method based on predictions from shallow neural networks was proposed. In this method, with the use of weakly supervised training method, a label reweighting network was set to reweight noise data, meta learning method was employed to facilitate dynamic learning of the model to noise data, and the prediction output from both deep and shallow networks was used as the pseudo labels to train the model. At the same time, the knowledge distillation algorithm was applied to allow the deep network to guide the training of the shallow networks. In this way, the memory behavior of the model was alleviated effectively and the robustness of the model was enhanced. Experiments conducted on CIFAR10/100 and Clothing1M datasets demonstrate the superiority of the proposed method over Meta Label Correction (MLC) method. Particularly, on CIFAR10 dataset with symmetrical noise ratios of 60% and 80%, the accuracy improvements are 3.49 and 1.56 percentage points respectively. Furthermore, in ablation experiments on CIFAR100 dataset with asymmetric noise ratio of 40%, at most 5.32 percentage points accuracy improvement is achieved by the proposed method over models trained without predicted labels, confirming the feasibility and effectiveness of the proposed method.

Key words: noisy label, meta learning, label correction, label reweighting, knowledge distillation

摘要:

针对深度神经网络(DNN)对含有噪声标签的图像数据具有记忆行为而导致的过拟合问题,提出一种基于浅层神经网络预测的元标签校正方法。该方法采用弱监督训练方式,通过设置标签重加权网络对噪声数据进行加权操作,利用元学习方法使模型动态地学习噪声数据,并将模型中深层与浅层网络的预测输出作为伪标签训练模型,同时利用知识蒸馏算法使深层网络指导浅层网络训练,以有效缓解模型的记忆行为并提升模型鲁棒性。在CIFAR10/100、Clothing1M数据集上的实验结果表明,相较于元标签校正(MLC)方法,所提方法在对称噪声比例为60%与80%的CIFAR10数据集上的准确率分别提升了3.49、1.56个百分点;此外,在CIFAR100数据集的消融实验中,非对称噪声比例为40%时,所提方法比无预测标签训练的模型准确率最高提升了5.32个百分点,验证了所提方法的可行性与有效性。

关键词: 噪声标签, 元学习, 标签校正, 标签重加权, 知识蒸馏

CLC Number: