《计算机应用》唯一官方网站

• •    下一篇

基于元学习的标签噪声自适应学习算法

齐巧玲1,王啸啸1,张茜茜1,汪鹏2,董永峰1   

  1. 1. 河北工业大学
    2. 河北工业大学 人工智能与数据科学学院
  • 收稿日期:2024-07-05 修回日期:2024-10-15 发布日期:2024-11-19 出版日期:2024-11-19
  • 通讯作者: 王啸啸
  • 基金资助:
    国家自然科学基金;河北省高等教育教学改革研究与实践项目

Label noise adaptive learning algorithm based on meta-learning

  • Received:2024-07-05 Revised:2024-10-15 Online:2024-11-19 Published:2024-11-19

摘要: 随着计算机视觉领域的不断进步,图像分类技术日益成熟,并逐渐用于各个领域。图像分类需要收集大量的图片进行模型训练与优化,但图片的收集过程不可避免地带来噪声标签。为了应对这一挑战,鲁棒性分类方法应运而生。目前的鲁棒性分类方法中超参数的设置需要手动调节,对人力物力带来了大量的损耗。因此,提出了元超参数调节器MHA(Meta Hyperparameter Adjuster),采用双层嵌套循环优化的方法自适应学习噪声感知的超参数组合,并称为Meta-FPL算法(Pseudo-label adaptive learning algorithm based on meta learning)。此外,为了解决元训练阶段反向传播过程耗费GPU大量算力的问题,提出了选择激活元模型层策略SAML(Select the activation metamodel layer strategy),通过比较虚拟训练阶段反向传播的平均梯度与元梯度的大小,限制部分元模型层的更新,有效提升了模型的训练效率。在四个基准数据集和一个真实数据集上分别进行了实验,实验结果表明Meta-FPL算法的分类准确率较高,且在元训练阶段的反向传播过程训练时长缩短了79.52%,说明Meta-FPL算法能在较短训练时间内有效提升分类测试准确率。

关键词: 深度学习, 深度神经网络, 图像分类, 标签噪声, 元学习

Abstract: With the continuous progress in the field of computer vision, image classification techniques are becoming more and more mature and are gradually used in various fields. Image classification requires the collection of a large number of images for model training and optimization, but the image collection process inevitably brings noisy labels. To cope with this challenge, robust classification methods have emerged. However, the setting of hyperparameters in the current robust classification methods needs to be adjusted manually, which brings a lot of loss in human and material resources. Therefore, in this paper propose Meta Hyperparameter Adjuster (MHA), which adopts a two-layer nested loop optimization method to adaptively learn noise-aware hyperparameter combinations, and is called Pseudo-label adaptive learning algorithm which based on meta learning (, Meta-FPL). In addition, in order to solve the problem that the backpropagation process in the meta training phase consumes a large amount of GPU arithmetic, the Select the activation metamodel layer strategy (SAML) is proposed, which restricts part of the metamodel layer by comparing the magnitude of the average gradient of the backpropagation and the meta-gradient in the virtual training phase, and effectively improves the model training. updates, which effectively improves the training efficiency of the model. Experiments were conducted on four benchmark datasets and one real dataset respectively, and the experimental results show that the Meta-FPL algorithm has a higher classification accuracy and the training time of the backpropagation process in the meta-training phase is shortened by 79.52%, which suggests that the Meta-FPL algorithm can effectively improve the classification test accuracy in a shorter training time.

Key words: deep learning, deep neural networks, image classification, label noise, meta-learning