计算机应用 ›› 2013, Vol. 33 ›› Issue (02): 547-582.DOI: 10.3724/SP.J.1087.2013.00547

• 数据库技术 • 上一篇    下一篇

基于清晰半径的模糊点二次聚类算法

高翠芳,胡权   

  1. 江南大学 理学院,江苏 无锡 214122
  • 收稿日期:2012-08-06 修回日期:2012-08-30 出版日期:2013-02-01 发布日期:2013-02-25
  • 通讯作者: 高翠芳
  • 作者简介:高翠芳(1974-),女,河北石家庄人,讲师,博士,主要研究方向:模式识别、生物信息学;
    胡权(1991-),女,湖南邵阳人,主要研究方向:模式识别。
  • 基金资助:
    中央高校基本科研业务费专项资金资助项目

Second clustering algorithm for fuzzy points based on clear radius

GAO Cuifang,HU Quan   

  1. School of Science, Jiangnan University, Wuxi Jiangsu 214122, China
  • Received:2012-08-06 Revised:2012-08-30 Online:2013-02-01 Published:2013-02-25
  • Contact: GAO Cuifang

摘要: 针对模糊C-均值(FCM)聚类算法在模糊边界上容易出现划分错误的问题,提出一种对模糊点进行二次处理的改进算法。该算法以各类中的数据分布密度为依据,首先利用清晰点构成超球体中心区域,然后基于中心区域的清晰半径定义一种新的相似性距离,并利用该距离对模糊点的隶属度进行二次计算,重新确定其类别归属。实验结果显示,改进算法能有效纠正分类错误,提高模糊点的清晰度,在密度差异较大的数据集上具有一定的应用潜力。

关键词: 模糊聚类, 模糊点, 相似性距离, 中心区域, 二次聚类

Abstract: Concerning the problem of wrong partition at fuzzy boundary in Fuzzy C-Means (FCM) clustering algorithm, an improved recalculation technique for fuzzy points was proposed. The new method took into account the data distribution characteristics in different classes. Firstly, it made the hyperspheres central regions by clear data, then defined a new similarity distance based on the clear radius of central region to recalculate the membership of fuzzy point, and finally reassigned the fuzzy points to right category. The experimental results show that the new algorithm can correct some wrong partition and improve the definition of fuzzy point, and also it is a promising algorithm for dataset with significant density differences.

Key words: fuzzy clustering, fuzzy point, similarity distance, central region, second clustering

中图分类号: