面向深度分类模型超参数自优化的代理模型

doi:10.11772/j.issn.1001-9081.2023091313

《计算机应用》唯一官方网站

• • 下一篇

面向深度分类模型超参数自优化的代理模型

张睿¹,潘俊铭¹,白晓露²,胡静¹,张荣国¹,张鹏云¹

1.太原科技大学计算机科学与技术学院 2.北京工业大学信息学部计算机学院

收稿日期:2023-09-25 修回日期:2024-03-07 发布日期:2024-04-01 出版日期:2024-04-01
通讯作者: 张睿
作者简介:张睿(1987—)，男，山西太原人，副教授，博士，CCF高级会员，主要研究方向：自动机器学习、智能信息处理；潘俊铭(1996—),男，山西太原人，硕士研究生，主要研究方向：智能信息处理；白晓露(1999—)，男，山西大同人，博士研究生，主要研究方向：智能信息处理；胡静(1977—)，女，山西大同人，教授，博士，CCF会员，主要研究方向：深度学习、图像处理；张荣国(1964—)，男，山西太原人，教授，博士，CCF高级会员，主要研究方向：图像处理、计算机视觉、模式识别；张鹏云(1999—)，男，山西太原人，硕士研究生，主要研究方向：智能信息处理。
基金资助:
教育部人文社会科学研究项目（23YJCZH299）；山西省基础研究计划项目（20210302123216、202203021211189）；太原科技大学研究生联合培养示范基地项目（JD2022004）；太原科技大学研究生教育创新项目（SY2023040）

Agent model for hyperparameter self-optimization of deep classification model

ZHANG Rui¹, PAN Junming¹, BAI Xiaolu², HU Jing¹, ZHANG Rongguo¹, ZHANG Pengyun¹

1. School of Computer Science and Technology, Taiyuan University of Science and Technology 2. School of Computer Science, Faculty of Information Science, Beijing University of Technology

Received:2023-09-25 Revised:2024-03-07 Online:2024-04-01 Published:2024-04-01
About author:ZHANG Rui, born in 1987, Ph. D., associate professor. His research interests include automatic machine learning, intelligent information processing. PAN Junming, born in 1996, M.S. candidate. His research interests include automatic machine learning, intelligent information processing. BAI Xiaolu, born in 1999, Ph. D. candidate. His research interests include intelligent information processing. HU Jing, born in 1977, Ph. D., professor. Her research interests include image segmentation and recognition, intelligent optimization algorithms. ZHANG Rongguo, born in 1964, Ph. D., professor. His research interests include image processing, computer vision, pattern recognition. ZHANG Pengyun, born in 1999, M.S. candidate. His research interests include automatic machine learning, intelligent information processing.
Supported by:
Humanities and Social Sciences research project of Ministry of Education（23YJCZH299），Shanxi Basic Research Program (20210302 123216,202203021211189), Taiyuan University of Science and Technology Graduate Joint Training Demonstration Base Project (JD2022004), Taiyuan University of Science and Technology Graduate Education Innovation Project (SY2023040)

摘要/Abstract

摘要： 为进一步提高深度分类模型超参数多目标自适应寻优效率，提出一种筛选式增强Dropout代理模型（Filter Enhanced Dropout Agent model, FEDA）。首先，构建点对互信息约束增强的双通道Dropout神经网络，增强对高维超参数深度分类模型的拟合，并结合聚集选解策略加速候选解集的选取；其次，设计一种结合模型管理策略的算法FEDA-ARMOEA均衡种群个体的收敛性和多样性,协助FEDA提高深度分类模型训练及超参数自优化效率。将FEDA-ARMOEA与EDN-ARMOEA、HeE-MOREA等算法进行对比实验，实验结果表明，FEDA-ARMOEA在41组测试问题上表现较好。在工业应用焊缝数据集MTF和公共数据集CIFAR-10上实验，FEDA-ARMOEA优化的分类模型在MTF数据集和CIFAR-10数据集上的精度分别达到96.16%和93.79%，训练时间相对对比算法分别平均提高了34.29%和25.55%，均优于对比算法，验证了所提算法的有效性和泛化性。

关键词: 深度卷积神经网络, 分类模型, 超参数优化, 代理模型, 模型优化

Abstract: To further improve the efficiency of hyperparameter multi-objective adaptive optimization of deep classification models, a Filter Enhanced Dropout Agent model （FEDA）was proposed. Firstly, a dual-channel Dropout neural network with enhanced point-to-point mutual information constraint was constructed, to enhance the fitting of high-dimensional hyperparameter deep classification model, and the selection of candidate solution sets was accelerated by combining the aggregation solution selection strategy. Second, an algorithm FEDA-ARMOEA combined with model management strategies was designed to balance the convergence and diversity of individual populations, and to assist FEDA in improving the efficiency of deep classification model training and hyperparameter self optimization. Comparative experiments were conducted between FEDA-ARMOEA, EDN-ARMOEA, HeE-MOREA, and other algorithms. Experimental results show that FEDA-ARMOEA performs well on 41 sets of testing problems. Experiments on industrial application weld data set MTF and public data set CIFAR-10 show that the accuracy of FEDA-ARMOEA optimized classification model on MTF and CIFAR-10 data set is 96.16% and 93.79%, respectively, and the training time is decreased by an average of 34.29% and 25.55% compared with the contrast algorithms, respectively. All of them are superior to the contrast algorithms', which verifies the effectiveness and generalization of the proposed algorithm.

Key words: deep convolutional neural network, classification model, hyperparameter optimization, agent model, model optimization

中图分类号:

TP183

张睿潘俊铭白晓露胡静张荣国张鹏云. 面向深度分类模型超参数自优化的代理模型[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2023091313.

ZHANG Rui, PAN Junming, BAI Xiaolu, HU Jing, ZHANG Rongguo, ZHANG Pengyun. Agent model for hyperparameter self-optimization of deep classification model[J]. Journal of Computer Applications, DOI: 10.11772/j.issn.1001-9081.2023091313.

[1]	佘维, 李阳, 钟李红, 孔德锋, 田钊. 基于改进实数编码遗传算法的神经网络超参数优化[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 671-676.
[2]	梁捷, 郝晓燕, 陈永乐. 面向视觉分类模型的投毒攻击[J]. 《计算机应用》唯一官方网站, 2023, 43(2): 467-473.
[3]	刘万军, 王佳铭, 曲海成, 董利兵, 曹欣宇. 基于频谱空间域特征注意的音乐流派分类算法[J]. 《计算机应用》唯一官方网站, 2022, 42(7): 2072-2077.
[4]	朱文球, 邹广, 曾志高. 融合层次特征和混合注意力的目标跟踪算法[J]. 《计算机应用》唯一官方网站, 2022, 42(3): 833-843.
[5]	戴朝霞, 曹堉栋, 朱光明, 沈沛意, 徐旭, 梅林, 张亮. 基于知识蒸馏的特定知识学习[J]. 《计算机应用》唯一官方网站, 2021, 41(12): 3426-3431.
[6]	薛锋, 史旭华, 史非凡. 基于代理模型的差分进化约束优化[J]. 计算机应用, 2020, 40(4): 1091-1096.
[7]	郑宗生, 胡晨雨, 姜晓轶. 基于改进的最大均值差异算法的深度迁移适配网络[J]. 计算机应用, 2020, 40(11): 3107-3112.
[8]	欧阳宁, 梁婷, 林乐平. 基于自注意力网络的图像超分辨率重建[J]. 计算机应用, 2019, 39(8): 2391-2395.
[9]	邓忠豪, 陈晓东. 基于深度卷积神经网络的肺结节检测算法[J]. 计算机应用, 2019, 39(7): 2109-2115.
[10]	何新宇, 张晓龙. 基于深度神经网络的肺炎图像识别模型[J]. 计算机应用, 2019, 39(6): 1680-1684.
[11]	邵长城, 陈平华. 融合社交网络和图像内容的兴趣点推荐[J]. 计算机应用, 2019, 39(5): 1261-1268.
[12]	胡秀华, 王长元, 肖锋, 王亚文. 利用空间结构信息的相关滤波目标跟踪算法[J]. 计算机应用, 2019, 39(4): 1150-1156.
[13]	卫鑫, 武淑红, 王耀力. 基于深度卷积长短期记忆网络的森林火灾烟雾检测模型[J]. 计算机应用, 2019, 39(10): 2883-2887.
[14]	王柯力, 袁红春. 基于迁移学习的水产动物图像识别方法[J]. 计算机应用, 2018, 38(5): 1304-1308.
[15]	洪睿, 康晓东, 郭军, 李博, 王亚鸽, 张秀芳. 基于复杂网络描述的图像深度卷积分类方法[J]. 计算机应用, 2018, 38(12): 3399-3402.

面向深度分类模型超参数自优化的代理模型

Agent model for hyperparameter self-optimization of deep classification model

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics