《计算机应用》唯一官方网站

• •    下一篇

面向深度分类模型超参数自优化的代理模型

张睿1,潘俊铭1,白晓露2,胡静1,张荣国1,张鹏云1   

  1. 1.太原科技大学 计算机科学与技术学院 2.北京工业大学 信息学部计算机学院
  • 收稿日期:2023-09-25 修回日期:2024-03-07 发布日期:2024-04-01 出版日期:2024-04-01
  • 通讯作者: 张睿
  • 作者简介:张睿(1987—),男,山西太原人,副教授,博士,CCF高级会员,主要研究方向:自动机器学习、智能信息处理;潘俊铭(1996—),男,山西太原人,硕士研究生,主要研究方向:智能信息处理;白晓露(1999—),男,山西大同人,博士研究生,主要研究方向:智能信息处理;胡静(1977—),女,山西大同人,教授,博士,CCF会员,主要研究方向:深度学习、图像处理;张荣国(1964—),男,山西太原人,教授,博士,CCF高级会员,主要研究方向:图像处理、计算机视觉、模式识别;张鹏云(1999—),男,山西太原人,硕士研究生,主要研究方向:智能信息处理。
  • 基金资助:
    教育部人文社会科学研究项目(23YJCZH299);山西省基础研究计划项目(20210302123216、202203021211189);太原科技大学研究生联合培养示范基地项目(JD2022004);太原科技大学研究生教育创新项目(SY2023040)

Agent model for hyperparameter self-optimization of deep classification model

ZHANG Rui1, PAN Junming1, BAI Xiaolu2, HU Jing1, ZHANG Rongguo1, ZHANG Pengyun1   

  1. 1. School of Computer Science and Technology, Taiyuan University of Science and Technology 2. School of Computer Science, Faculty of Information Science, Beijing University of Technology
  • Received:2023-09-25 Revised:2024-03-07 Online:2024-04-01 Published:2024-04-01
  • About author:ZHANG Rui, born in 1987, Ph. D., associate professor. His research interests include automatic machine learning, intelligent information processing. PAN Junming, born in 1996, M.S. candidate. His research interests include automatic machine learning, intelligent information processing. BAI Xiaolu, born in 1999, Ph. D. candidate. His research interests include intelligent information processing. HU Jing, born in 1977, Ph. D., professor. Her research interests include image segmentation and recognition, intelligent optimization algorithms. ZHANG Rongguo, born in 1964, Ph. D., professor. His research interests include image processing, computer vision, pattern recognition. ZHANG Pengyun, born in 1999, M.S. candidate. His research interests include automatic machine learning, intelligent information processing.
  • Supported by:
    Humanities and Social Sciences research project of Ministry of Education(23YJCZH299),Shanxi Basic Research Program (20210302 123216,202203021211189), Taiyuan University of Science and Technology Graduate Joint Training Demonstration Base Project (JD2022004),  Taiyuan University of Science and Technology Graduate Education Innovation Project (SY2023040)

摘要: 为进一步提高深度分类模型超参数多目标自适应寻优效率,提出一种筛选式增强Dropout代理模型(Filter Enhanced Dropout Agent model, FEDA)。首先,构建点对互信息约束增强的双通道Dropout神经网络,增强对高维超参数深度分类模型的拟合,并结合聚集选解策略加速候选解集的选取;其次,设计一种结合模型管理策略的算法FEDA-ARMOEA均衡种群个体的收敛性和多样性,协助FEDA提高深度分类模型训练及超参数自优化效率。将FEDA-ARMOEA与EDN-ARMOEA、HeE-MOREA等算法进行对比实验,实验结果表明,FEDA-ARMOEA在41组测试问题上表现较好。在工业应用焊缝数据集MTF和公共数据集CIFAR-10上实验,FEDA-ARMOEA优化的分类模型在MTF数据集和CIFAR-10数据集上的精度分别达到96.16%和93.79%,训练时间相对对比算法分别平均提高了34.29%和25.55%,均优于对比算法,验证了所提算法的有效性和泛化性。

关键词: 深度卷积神经网络, 分类模型, 超参数优化, 代理模型, 模型优化

Abstract: To further improve the efficiency of hyperparameter multi-objective adaptive optimization of deep classification models, a Filter Enhanced Dropout Agent model (FEDA)was proposed. Firstly, a dual-channel Dropout neural network with enhanced point-to-point mutual information constraint was constructed, to enhance the fitting of high-dimensional hyperparameter deep classification model, and the selection of candidate solution sets was accelerated by combining the aggregation solution selection strategy. Second, an algorithm FEDA-ARMOEA combined with model management strategies was designed to balance the convergence and diversity of individual populations, and to assist FEDA in improving the efficiency of deep classification model training and hyperparameter self optimization. Comparative experiments were conducted between FEDA-ARMOEA, EDN-ARMOEA, HeE-MOREA, and other algorithms. Experimental results show that FEDA-ARMOEA performs well on 41 sets of testing problems. Experiments on industrial application weld data set MTF and public data set CIFAR-10 show that the accuracy of FEDA-ARMOEA optimized classification model on MTF and CIFAR-10 data set is 96.16% and 93.79%, respectively, and the training time is decreased by an average of 34.29% and 25.55% compared with the contrast algorithms, respectively. All of them are superior to the contrast algorithms', which verifies the effectiveness and generalization of the proposed algorithm.

Key words: deep convolutional neural network, classification model, hyperparameter optimization, agent model, model optimization

中图分类号: