Journal of Computer Applications ›› 2021, Vol. 41 ›› Issue (5): 1290-1298.DOI: 10.11772/j.issn.1001-9081.2020081192

Special Issue: 人工智能

• Artificial intelligence • Previous Articles     Next Articles

Simultaneous feature selection optimization based on improved spotted hyena optimizer algorithm

JIA Heming1,2, JIANG Zichao2, LI Yao2, SUN Kangjian2   

  1. 1. School of Information Engineering, Sanming University, Sanming Fujian 365004, China;
    2. College of Mechanical and Electrical Engineering, Northeast Forestry University, Harbin Heilongjiang 150040, China
  • Received:2020-08-10 Revised:2020-10-18 Online:2021-05-10 Published:2020-11-25
  • Supported by:
    This work is partially supported by the Project of Ministry of Education Industry-University Cooperation Collaborative Education (202002064014),the Educational Research Project of Young and Middle-aged Teachers in Fujian Province (JAT200618), the Guiding Science and Technology Project in Sanming City (2020-G-61), the Scientific Research Start Fund for Sanming University Introduced High-Level Talents (20YG14), the Scientific Research and Development Fund of Sanming University (B202009), the Project of Sanming University Higher Education Research (SHE2013), the Open Research Fund of Fujian Provincial Key Laboratory of Agriculture Internet of Things Application (ZD2101)

基于改进斑点鬣狗优化算法的同步优化特征选择

贾鹤鸣1,2, 姜子超2, 李瑶2, 孙康健2   

  1. 1. 三明学院 信息工程学院, 福建 三明 365004;
    2. 东北林业大学 机电工程学院, 哈尔滨 150040
  • 通讯作者: 贾鹤鸣
  • 作者简介:贾鹤鸣(1983-),男,辽宁辽阳人,教授,博士,CCF会员,主要研究方向:群体智能优化、特征选择、多阈值图像分割;姜子超(1995-),男,黑龙江齐齐哈尔人,硕士研究生,主要研究方向:机器学习、群智能优化、特征选择;李瑶(1997-),女,黑龙江伊春人,硕士研究生,主要研究方向:群智能优化、特征选择;孙康健(1996-),男,辽宁锦州人,硕士研究生,主要研究方向:群智能优化、特征选择。
  • 基金资助:
    教育部产学合作协同育人项目(202002064014);福建省教育厅中青年教师教育科研项目(JAT200618);三明市科技计划引导性项目(2020-G-61);三明学院引进高层次人才科研启动经费支持项目(20YG14);三明学院科学研究发展基金资助项目(B202009);三明学院高教研究课题(SHE2013);福建省农业物联网应用重点实验室开放研究基金资助项目(ZD2101)。

Abstract: Aiming at the disadvantages of traditional Support Vector Machine (SVM) in the wrapper feature selection:low classification accuracy, redundant feature subset selection and poor computational efficiency, the meta-heuristic optimization algorithm was used to simultaneously optimize SVM and feature selection. In order to improve the classification effect of SVM and the ability of feature subset selection, firstly, the Spotted Hyena Optimizer (SHO) algorithm was improved by using the adaptive Differential Evolution (DE) algorithm, chaotic initialization and tournament selection strategy, so as to enhance its local search ability as well as improve its optimization efficiency and solution accuracy; secondly, the improved algorithm was applied to the simultaneous optimization of feature selection and SVM parameter adjustment; finally, a feature selection simulation experiment was carried out on the UCI datasets, and the classification accuracy, the number of selected features, the fitness value and the running time were used to comprehensively evaluate the optimization performance of the proposed algorithm. Experimental results show that the simultaneous optimization mechanism of the improved algorithm can reduce the number of selected features with high classification accuracy, and compared to the traditional algorithms, this algorithm is more suitable for solving the problem of wrapper feature selection, which has good application value.

Key words: Spotted Hyena Optimizer (SHO) algorithm, Differential Evolution (DE), chaotic initialization, tournament selection, Support Vector Machine (SVM), wrapper feature selection

摘要: 针对传统支持向量机(SVM)在封装式特征选择中分类精度低、特征子集选择冗余以及计算效率差的不足,利用元启发式优化算法同步优化SVM与特征选择。为改善SVM分类效果以及选择特征子集的能力,首先,利用自适应差分进化(DE)算法、混沌初始化与锦标赛选择策略对斑点鬣狗优化(SHO)算法改进,以增强其局部搜索能力并提高其寻优效率与求解精度;其次,将改进后的算法用于特征选择与SVM参数调整的同步优化中;最后,在UCI数据集进行特征选择仿真实验,采取分类准确率、选择特征数、适应度值及运行时间来综合评估所提算法的优化性能。实验结果证明,改进算法的同步优化机制能够在高分类准确率下降低特征选择的数目,该算法比传统算法更适合解决封装式特征选择问题,具有良好的应用价值。

关键词: 斑点鬣狗优化算法, 差分进化, 混沌初始化, 锦标赛选择, 支持向量机, 封装式特征选择

CLC Number: