基于邻域关系模糊粗糙集的分类新方法

doi:10.11772/j.issn.1001-9081.2015.11.3116

计算机应用 ›› 2015, Vol. 35 ›› Issue (11): 3116-3121.DOI: 10.11772/j.issn.1001-9081.2015.11.3116

• 2015年全国开放式分布与并行计算学术年会(DPCS 2015)论文 • 上一篇下一篇

基于邻域关系模糊粗糙集的分类新方法

胡学伟, 蒋芸, 李志磊, 沈健, 华锋亮

西北师范大学计算机科学与工程学院, 兰州 730070

收稿日期:2015-06-17 修回日期:2015-07-08 发布日期:2015-11-13
通讯作者: 胡学伟(1991-),男,陕西咸阳人,硕士研究生,主要研究方向:数据挖掘、粗糙集.
作者简介:蒋芸(1970-),女,浙江绍兴人,教授,博士,CCF会员,主要研究方向:数据挖掘、粗糙集; 李志磊(1991-),女,河北衡水人,主要研究方向:数据挖掘、粗糙集; 沈健(1990-),男,安徽合肥人,硕士研究生,主要研究方向:数据挖掘、粗糙集; 华锋亮(1989-),女,陕西咸阳人,硕士研究生,主要研究方向:分布式与并行计算.
基金资助:
国家自然科学基金资助项目(61163036,61163039);甘肃省高等学校研究生导师科研基金资助项目(1201-16);西北师范大学第三期知识与创新工程科研骨干项目(nwnu-kjcxgc-03-67).

New classification method based on neighborhood relation fuzzy rough set

HU Xuewei, JIANG Yun, LI Zhilei, SHEN Jian, HUA Fengliang

College of Computer Science and Engineering, Northwest Normal University, Lanzhou Gansu 730070, China

Received:2015-06-17 Revised:2015-07-08 Published:2015-11-13

摘要/Abstract

摘要： 针对目前模糊等价关系所诱导的模糊粗糙集模型不能准确地反映模糊概念范畴中数值属性描述的决策问题,提出一种基于邻域关系的模糊粗糙集模型NR-FRS,给出了该粗糙集模型的相关定义,在讨论模型性质的基础上进行模糊化邻域近似空间上的推理,并分析特征子空间下的属性依赖性;最后在NR-FRS的基础上提出特征选择算法,构建使得模糊正域增益优于具体阈值的特征子集,进而剔除冗余特征,保留分类能力强的属性.采用UCI标准数据集进行分类实验,使用径向基核函数(RBF)支持向量机作为分类器.实验结果表明,同基于邻域粗糙集的快速前向特征选择方法以及核主成分分析方法(KPCA)相比,NR-FRS模型特征选择算法所得特征子集中特征数量依据参数变化更加平缓、稳定.同时平均分类准确率提升最好可以达到5.2%,且随特征选择参数呈现更加平稳的变化.

关键词: 粒化和逼近, 特征选择, 邻域关系, 属性依赖性

Abstract: Since fuzzy rough sets induced by fuzzy equivalence relations can not quite accurately reflect decision problems described by numerical attributes among fuzzy concept domain, a fuzzy rough set model based on neighborhood relation called NR-FRS was proposed. First of all, the definitions of the rough set model were presented. Based on properties of NR-FRS, a fuzzy neighborhood approximation space reasoning was carried out, and attribute dependency in characteristic subspace was also analyzed. Finally, feature selection algorithm based on NR-FRS was presented, and feature subsets was constructed next, which made fuzzy positive region greater than a specific threshold, thereby getting rid of redundant features and reserving attributes that have a strong capability in classification. Classification experiment was implemented on UCI standard data sets, which used Radial Basis Function (RBF) support vector machine as the classifier. The experimental results show that, compared with fast forward feature selection based on neighborhood rough set as well as Kernel Principal Component Analysis (KPCA), feature number of the subset obtained by NR-FRS model feature selection algorithm changes more smoothly and stably according to parameters. Meanwhile, average classification accuracy increases by 5.2% in the best case and varies stably according to parameters.

Key words: granulating and approximation, feature selection, neighborhood relation, attribute dependence

中图分类号:

TP181

胡学伟, 蒋芸, 李志磊, 沈健, 华锋亮. 基于邻域关系模糊粗糙集的分类新方法[J]. 计算机应用, 2015, 35(11): 3116-3121.

HU Xuewei, JIANG Yun, LI Zhilei, SHEN Jian, HUA Fengliang. New classification method based on neighborhood relation fuzzy rough set[J]. Journal of Computer Applications, 2015, 35(11): 3116-3121.

参考文献

[1] PAWLAK Z. Rough sets[J]. International Journal of Information and Computer Science,1982,11(5):129-141.
[2] HU Q,YU D. Application of rough calculation[M]. Beijing: Science Press,2012:16-97.(胡清华,于达仁.应用粗糙计算[M]. 北京:科学出版社,2012:16-97.)
[3] YAO Y. Three-way decisions with probabilistic rough sets[J]. Information Sciences,2010,180(3):341-353.
[4] YAO Y. The superiority of three-way decisions in probabilistic rough set models[J].Information Sciences, 2011,18(6):1080-1096.
[5] YAO Y. Two semantic issues in a probabilistic rough set model[J]. Fundamenta Informaticae,2011,108(3/4): 249-265.
[6] SHARMA R, JAIN P, SHRIVASTAVA K S, et al. An optimize decision tree algorithm based on variable precision rough set theory using degree of β -dependency and significance of attributes[J]. International Journal of Computer Science and Information Technologies, 2012,3(3):3942-3947.
[7] ZHANG W, WU W, LIANG J. Rough set theory and method[M]. Beijing: Science Press,2005:132-157.(张文修,吴伟志,梁吉业.粗糙集理论与方法[M]. 北京:科学出版社,2005:132-157.)
[8] WANG G, MA X, YU H. Monotonic uncertainty measures for attribute reduction in probabilistic rough set model[J]. International Journal of Approximate Reasoning, 2015,59(3): 41-67.
[9] JIA X, TANG Z, LIAO W, et al. On an optimization representation of decision-theoretic rough set model[J]. International Journal of Approximate Reasoning, 2014, 55(1): 156-166.
[10] NGUYEN X T, NGUYEN D D. Rough fuzzy relation on two universal sets[J]. International Journal of Intelligent Systems and Applications, 2014, 6(4): 49-52.
[11] CHEN D, YANG Y. Attribute reduction for heterogeneous data based on the combination of classical and fuzzy rough set models[J]. IEEE Transactions on Fuzzy Systems, 2014, 22(5): 1325-1334.
[12] LIN T Y,HUANG K J,LIU Q,et al. Rough sets, neighborhood systems and approximation[C]// ISMIS 1990: Proceedings of the 5th International Symposium on Methodologies of Intelligent System. Charlotte: Elsevier Science, 1990: 19-90.
[13] LINGRAS P, CHEN M, MIAO D. Qualitative and quantitative combinations of crisp and rough clustering schemes using dominance relations[J]. International Journal of Approximate Reasoning, 2014,55(1):238-258.
[14] HU Q,YU D, LIU J. Neighborhood rough set based heterogeneous feature subset selection[J].Information Science,2008,178(18):3577-3594.
[15] HU Q, YU D, XIE Z. Numerical attribute reduction based on neighborhood granulation and rough approximation[J]. Journal of Software,2008,19(3):640-649.(胡清华,于达仁,谢宗霞.基于邻域粒化和粗糙逼近的数值属性约简[J].软件学报,2008,19(3):640-649.)
[16] HU Q, YU D, XIE Z. Neighborhood classifiers[J]. Expert Systems with Applications, 2008,34(2): 866-876.
[17] FRANK A, ASUNCION A. UCI machine learning repository[DB/OL]. [2013-12-12]. http://archive.Ice.Uci.Edu/ml.
[18] HU Q, AN S, YU D. Soft fuzzy rough sets for robust feature evaluation and selection[J]. Information Sciences, 2010,180(22): 4384-4400.
[19] LIU Y, HUANG W, JIANG Y, et al. Quick attribute reduct algorithm for neighborhood rough set model[J].Information Sciences,2014,271(7):65-81.
[20] KUANG F, XU W, ZHANG S.A novel hybrid KPCA and SVM with GA model for intrusion detection[J].Applied Soft Computing,2014,18(4):178-184.
[21] MIAO D, LI D. Rough sets theory algorithms and applications[M].Beijing: Tsinghua University Press,2008:246-247.(苗夺谦,李道国.粗糙集理论、算法与应用[M]. 北京:清华大学出版社,2008:246-247.)

基于邻域关系模糊粗糙集的分类新方法

New classification method based on neighborhood relation fuzzy rough set

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	湛航, 何朗, 黄樟灿, 李华峰, 张蔷, 谈庆. 改进的基于层次距离的基因表达式编程特征选择分类算法[J]. 计算机应用, 2021, 41(9): 2658-2667.
[2]	祝承, 赵晓琦, 赵丽萍, 焦玉宏, 朱亚飞, 陈建英, 周伟, 谭颖. 基于谱聚类半监督特征选择的功能磁共振成像数据分类[J]. 计算机应用, 2021, 41(8): 2288-2293.
[3]	李蒙蒙, 秦伟, 刘艺, 刁兴春. 结合头脑风暴优化的混合蚁群优化算法[J]. 计算机应用, 2021, 41(8): 2412-2417.
[4]	林筠超, 万源. 基于图结构优化的自适应多度量非监督特征选择方法[J]. 计算机应用, 2021, 41(5): 1282-1289.
[5]	贾鹤鸣, 姜子超, 李瑶, 孙康健. 基于改进斑点鬣狗优化算法的同步优化特征选择[J]. 计算机应用, 2021, 41(5): 1290-1298.
[6]	张志浩, 林耀进, 卢舜, 郭晨, 王晨曦. 缺失标记下基于类属属性的多标记特征选择[J]. 计算机应用, 2021, 41(10): 2849-2857.
[7]	顾桐, 许国良, 李万林, 李家浩, 王志愿, 雒江涛. 基于集成LightGBM和贝叶斯优化策略的房价智能评估模型[J]. 计算机应用, 2020, 40(9): 2762-2767.
[8]	黄学雨, 徐浩特, 陶剑文. 具有特征选择的多源自适应分类框架[J]. 计算机应用, 2020, 40(9): 2499-2506.
[9]	刘丹, 姚立霜, 王云锋, 裴作飞. 面向类不平衡流量数据的分类模型[J]. 计算机应用, 2020, 40(8): 2327-2333.
[10]	肖跃雷, 张云娇. 基于特征选择和超参数优化的恐怖袭击组织预测方法[J]. 计算机应用, 2020, 40(8): 2262-2267.
[11]	汪志远, 降爱莲, 奥斯曼·穆罕默德. 基于正则互表示的无监督特征选择方法[J]. 计算机应用, 2020, 40(7): 1896-1900.
[12]	曹堉, 王成, 王鑫, 高悦尔. 基于时空节点选择和深度学习的城市道路短时交通流预测[J]. 计算机应用, 2020, 40(5): 1488-1493.
[13]	谢琪, 徐旭, 程耕国, 陈和平. 基于新的森林优化算法的特征选择算法[J]. 计算机应用, 2020, 40(5): 1266-1271.
[14]	曾元鹏, 王开军, 林崧. 面向二类区分能力的干扰熵特征选择方法[J]. 计算机应用, 2020, 40(3): 626-630.
[15]	章夏杰, 朱敬华, 陈杨. Spark下的分布式粗糙集属性约简算法[J]. 计算机应用, 2020, 40(2): 518-523.