计算机应用

• 数据库技术(Database technology) • 上一篇    下一篇

广义粗糙集理论及实值属性约简

肖迪 张军峰   

  1. 南京工业大学 自动化学院 南京航空航天大学 自动化学院
  • 收稿日期:2007-12-21 修回日期:2008-02-02 发布日期:2008-06-01 出版日期:2008-06-01
  • 通讯作者: 肖迪

Generalization rough set theory and real-valued attributes reduction

Di XIAO Jun-feng ZHANG   

  • Received:2007-12-21 Revised:2008-02-02 Online:2008-06-01 Published:2008-06-01
  • Contact: Di XIAO

摘要: 针对经典粗糙集理论仅能处理离散化数据的局限性,提出属性和属性子集的广义重要度的概念以及空间中的广义近邻关系,并提出了广义近邻关系下的广义粗糙集扩展模型。广义粗糙集理论利用广义近邻关系在全局中划分相容模块,构成集合的下、上近似集,避免了经典粗糙集理论必须量化数据的麻烦。另外,提出了广义粗糙集的实值属性约简的一种贪心算法,并分析了约简属性集合的质量。最后通过实例验证了所提方法的正确性和有效性。

关键词: 数据挖掘, 广义粗糙集理论, 广义重要度, 近似约简

Abstract: Considering that the classical rough set theory can only process the discrete data, the degree of general importance of an attribute and attribute subsets was presented. And then a generalization rough set theory was proposed based on the general near neighborhood relation. The theory partitioned the universe into the tolerant modules and formed lower approximation and upper approximation of the set under general near neighborhood relationship which avoided the discretization in Pawlak's rough set. Furthermore, the definition of attribute reduction in generalization rough set and its greedy algorithm were proposed. Finally, results of some examples show the correctness and validity of this method.

Key words: data mining, general rough set theory, degree of general importance, approximation reduction