计算机应用 ›› 2015, Vol. 35 ›› Issue (8): 2360-2365.DOI: 10.11772/j.issn.1001-9081.2015.08.2360

• 数据技术 • 上一篇    下一篇

基于可变容差关系的变精度粗糙集模型

郑树梅, 续欣莹, 谢珺, 闫高伟   

  1. 太原理工大学 信息工程学院, 太原 030024
  • 收稿日期:2015-03-01 修回日期:2015-04-09 出版日期:2015-08-10 发布日期:2015-08-14
  • 通讯作者: 续欣莹(1979-),男,山西定襄人,副教授,博士,CCF会员,主要研究方向:粒计算、大数据分析、智能控制,xuxinying@tyut.edu.cn
  • 作者简介:郑树梅(1989-),女,山西忻州人,硕士研究生,主要研究方向:粒计算、粗糙集; 谢珺(1979-),女,山西五台人,副教授,博士,主要研究方向:粒计算、粗糙集、数据挖掘、机器学习; 阎高伟(1970-),男,山西太原人,教授,博士,主要研究方向:智能信息处理、多传感器信息融合。
  • 基金资助:

    国家自然科学基金资助项目(61450011);山西省自然科学基金资助项目(2014011018-2);山西省回国留学人员科研资助项目(2013-033);山西省留学回国人员科技活动择优资助项目。

Variable precision rough set model based on variable-precision tolerance relation

ZHENG Shumei, XU Xinying, XIE Jun, YAN Gaowei   

  1. College of Information Engineering, Taiyuan University of Technology, Taiyuan Shanxi 030024, China
  • Received:2015-03-01 Revised:2015-04-09 Online:2015-08-10 Published:2015-08-14

摘要:

针对已有不完备信息系统扩展粗糙集模型对噪声鲁棒性差的局限性,首先分析了调节基本知识粒大小的同时引入相对错误分类度的必要性;然后结合系统属性值的缺失定义了对象联系度权值矩阵,并以此为基础提出了基于可变容差关系的变精度粗糙集模型(VPRS-VPTR);接着讨论了模型的性质,分析了模型中相关参数(基本知识粒大小、相对错误分类度)对分类精度的影响,给出了分类精度随模型中相关参数变化的求解算法与时间复杂度分析;最后通过仿真实验与相关研究的扩展粗糙集模型进行对比。仿真结果显示,VPRS-VPTR分类精度更高,而且针对UCI数据库上的几组不完备数据集进行仿真实验的结果还表明,相同参数下各不完备数据集的测试集和训练集分类精度变化趋势相同,进而验证了模型的有效性、灵活性及所提算法的可行性。

关键词: 粗糙集, 可变容差关系, 知识粒度, 变精度, 分类精度

Abstract:

Focusing on the underdeveloped robustness when the existing extended rough set model encounters the noise for the incomplete information system, the necessity of adjusting the size of basic knowledge granule as well as introducing the relative degree of misclassification was analyzed. Then the Variable Precision Rough Set model based on Variable-Precision Tolerance Relation (VPRS-VPTR) was established on the basis of the object connection weight matrix, which was proposed according to the lack probability of system attribute value. Moreover, the properties of the VPRS-VPTR model were discussed, the classification accuracy under the basic knowledge granule size and the relative degree of misclassification was analyzed, the corresponding algorithm was depicted and the time complexity analysis was given afterwards. The experimental results show that the VPRS-VPTR model has higher classification accuracy compared with some other research about the expanded rough set, and the change trend of the classification accuracy is similar for the train set and the test set of several groups of incomplete data sets in UCI database. It proves that the proposed model is more precise and flexible, and the algorithm is feasible and effective.

Key words: rough set, variable tolerance relation, knowledge granule, variable precision, classification accuracy

中图分类号: