基于特征选择的多侧面覆盖算法

doi:10.3724/SP.J.1087.2011.01318

计算机应用 ›› 2011, Vol. 31 ›› Issue (05): 1318-1320.DOI: 10.3724/SP.J.1087.2011.01318

基于特征选择的多侧面覆盖算法

吴涛^1,2,张方方²

1.安徽大学计算智能与信号处理教育部重点实验室, 合肥 230039
2.安徽大学数学科学学院, 合肥 230039

收稿日期:2010-11-17 修回日期:2011-01-05 发布日期:2011-05-01 出版日期:2011-05-01
通讯作者: 张方方
作者简介:吴涛(1970-),男,安徽太和人,教授,博士,主要研究方向:机器学习、智能计算;张方方(1986-),女,安徽蒙城人,硕士研究生,主要研究方向:机器学习、智能计算。
基金资助:
国家自然科学基金资助项目(60675031);国家973计划项目(2007BC311003);安徽省高等学校省级自然科学研究项目(KJ2008B093);安徽大学创新团队(KJTD001B);安徽大学人才队伍建设经费资助项目。

Multi-side covering algorithm based on feature selection

WU Tao^1,2, ZHANG Fang-fang²

1.Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Anhui University, Hefei Anhui 230039, China
2.School of Mathematical Sciences, Anhui University, Hefei Anhui 230039, China

Received:2010-11-17 Revised:2011-01-05 Online:2011-05-01 Published:2011-05-01
Contact: Fangfang Zhang

摘要/Abstract

摘要： 多侧面覆盖算法对海量高维数据的分类采用分而治之的思想,依据分量差的绝对值和,选取部分属性构建不同样本子集的覆盖,降低了学习的复杂度,但初始属性集的选择依据经验或实验获得。为降低初始属性集选择的主观性和属性集调整的复杂性,利用Relief特征选择方法确定适合不同数据集的最优特征子集,构建了分层递阶的覆盖网络,并对实际数据集进行实验。实验结果表明,该算法具有较高的精度和效率,可以有效地实现复杂问题的分类。

关键词: 覆盖算法, 特征选择, 多侧面递进

Abstract: The multi-side covering algorithm is designed guided by the idea of divide-and-conquer to the mass high-dimensional data. According to the sum of the absolute value of the component deviation, subsets of attributes were selected to construct respective covering domains for different parts of training samples, thus reducing the complexity of learning. But the selection of initial attribute set should be acquired by experience or experiments. In order to reduce the subjectivity with the selection of initial attribute set and the complexity with the regulation of attribute set, the relief feature selection approach was used to ensure the optimal feature subset that can be appropriate for different data sets, build a hierarchical overlay network, and experiment on the actual data set. The experimental results show that this algorithm is provided with higher precision and efficiency. Therefore, the algorithm can effectively achieve the classification of the complex issues.

Key words: covering algorithm, feature selection, multi-side increase by degree

吴涛张方方. 基于特征选择的多侧面覆盖算法[J]. 计算机应用, 2011, 31(05): 1318-1320.

WU Tao ZHANG Fang-fang. Multi-side covering algorithm based on feature selection[J]. Journal of Computer Applications, 2011, 31(05): 1318-1320.

[1]	陈虹, 齐兵, 金海波, 武聪, 张立昂. 融合1D-CNN与BiGRU的类不平衡流量异常检测[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2493-2499.
[2]	雷明珠, 王浩, 贾蓉, 白琳, 潘晓英. 基于特征间关系合成少数类样本的过采样算法[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1428-1436.
[3]	高麟, 周宇, 邝得互. 进化双层自适应局部特征选择[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1408-1414.
[4]	徐大鹏, 侯新民. 基于网络结构设计的图神经网络特征选择方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 663-670.
[5]	孟圣洁, 于万钧, 陈颖. 最大相关和最大差异的高维数据特征选择算法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 767-771.
[6]	孙林, 刘梦含. 基于自适应布谷鸟优化特征选择的K-means聚类[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 831-841.
[7]	刘晶鑫, 黄雯静, 徐亮胜, 黄冲, 吴建生. 字典学习与样本关联保持结合的无监督特征选择模型[J]. 《计算机应用》唯一官方网站, 2024, 44(12): 3766-3775.
[8]	何添, 沈宗鑫, 黄倩倩, 黄雁勇. 基于自适应学习的多视图无监督特征选择方法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2657-2664.
[9]	孙林, 黄金旭, 徐久成. 基于邻域容差互信息和鲸鱼优化算法的非平衡数据特征选择[J]. 《计算机应用》唯一官方网站, 2023, 43(6): 1842-1854.
[10]	蒋溢, 伍书平, 胡昆, 龙林波. 基于Lasso和构造性覆盖算法的不均衡数据分类方法[J]. 《计算机应用》唯一官方网站, 2023, 43(4): 1086-1093.
[11]	于振华, 刘争气, 刘颖, 郭城. 基于自适应混合粒子群优化的软件缺陷预测特征选择方法[J]. 《计算机应用》唯一官方网站, 2023, 43(4): 1206-1213.
[12]	孙林, 马天娇, 薛占熬. 基于Fisher score与模糊邻域熵的多标记特征选择算法[J]. 《计算机应用》唯一官方网站, 2023, 43(12): 3779-3789.
[13]	徐精诚, 陈学斌, 董燕灵, 杨佳. 融合特征选择的随机森林DDoS攻击检测[J]. 《计算机应用》唯一官方网站, 2023, 43(11): 3497-3503.
[14]	马磊, 罗川, 李天瑞, 陈红梅. 基于模糊粗糙集的无监督动态特征选择算法[J]. 《计算机应用》唯一官方网站, 2023, 43(10): 3121-3128.
[15]	陈亮, 汤显峰. 改进正余弦算法优化特征选择及数据分类[J]. 《计算机应用》唯一官方网站, 2022, 42(6): 1852-1861.

基于特征选择的多侧面覆盖算法

Multi-side covering algorithm based on feature selection

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics