计算机应用 ›› 2015, Vol. 35 ›› Issue (8): 2366-2370.DOI: 10.11772/j.issn.1001-9081.2015.08.2366

• 数据技术 • 上一篇    下一篇

新的基于区分对象集的邻域粗糙集属性约简算法

梁海龙, 谢珺, 续欣莹, 任密蜂   

  1. 太原理工大学 信息工程学院, 太原 030024
  • 收稿日期:2015-03-09 修回日期:2015-04-27 出版日期:2015-08-10 发布日期:2015-08-14
  • 通讯作者: 谢珺(1979-),女,山西五台人,副教授,博士,CCF会员,主要研究方向:粒计算、粗糙集、数据挖掘、机器学习,xiejun@tyut.edu.cn
  • 作者简介:梁海龙(1988-),男,山东邹城人,硕士研究生, CCF会员,主要研究方向:粗糙集、数据挖掘; 续欣莹(1979-),男,山西定襄人,副教授,博士, CCF会员,主要研究方向:粒计算、大数据分析、智能控制; 任密蜂(1985-),女,河北平山人,讲师,博士,主要研究方向:随机分布控制、故障诊断、控制系统性能评估。
  • 基金资助:

    山西省自然科学基金资助项目(2014011018-2);山西省回国留学人员科研资助项目(2013-033);山西省留学回国人员科技活动择优资助项目;太原理工大学校基金青年项目(2014QN015)。

New attribute reduction algorithm of neighborhood rough set based on distinguished object set

LIANG Hailong, XIE Jun, XU Xinying, REN Mifeng   

  1. College of Information Engineering, Taiyuan University of Technology, Taiyuan Shanxi 030024, China
  • Received:2015-03-09 Revised:2015-04-27 Online:2015-08-10 Published:2015-08-14

摘要:

基于正域的属性约简算法是利用"下近似"思想,仅考虑被正确区分样本数的约简算法。借鉴"上近似"的思想,利用"邻域信息粒"的概念定义了区分对象集,探讨了其基本性质,并提出了基于区分对象集的属性重要度度量及启发式属性约简算法。该约简算法既考虑信息决策表的相对正域,也考虑以核属性为启发信息逐个增加条件属性时对边界域样本的影响。通过实例分析,说明了所提算法的可行性,并且以6个UCI标准数据集为实验对象,与基于正域的属性约简算法进行对比实验。实验结果说明,采用提出的约简算法得到的约简属性集,与基于正域的属性约简算法相比,在进行分类任务时的分类精度能够保持不变或有所提高。

关键词: 属性约简, 属性重要度, 相对正域, 邻域粗糙集, 分类精度

Abstract:

Since the algorithm of attribute reduction based on positive region is based on the thought of lower approximation, it just considers the right distinguished samples. Using the thought of upper approximation and the concept of neighborhood information granule, the distinguished object set with its basic characteristics was designed and analyzed, then the new attribute importance measurement based on distinguished object set and heuristic attribute reduction algorithm was proposed. The proposed algorithm considered both the relative positive region of information decision table and the influence on boundary samples when growing condition attributes. The feasibility of the algorithm was discussed by instance analysis, and the comparative experiments on UCI data set with attribute reduction algorithm based on positive region were carried out. The experimental results show that the proposed attribute reduction algorithm can get better reduction, and the classification precision of sample set can remain the same or has certain improvement.

Key words: attribute reduction, attribute importance, relative positive region, neighborhood rough set, classification precision

中图分类号: