计算机应用 ›› 2012, Vol. 32 ›› Issue (07): 1978-1982.DOI: 10.3724/SP.J.1087.2012.01978

• 人工智能 • 上一篇    下一篇

基于w-距离均值的模糊聚类算法

张瑞丽,张继福   

  1. 太原科技大学 计算机科学与技术学院,太原030024
  • 收稿日期:2011-12-31 修回日期:2012-03-06 发布日期:2012-07-05 出版日期:2012-07-01
  • 通讯作者: 张瑞丽
  • 作者简介:张瑞丽(1985-),女,山东菏泽人,硕士研究生,主要研究方向:数据挖掘;张继福(1963-),男,山西太原人,教授,博士生导师,博士,主要研究方向:数据挖掘、人工智能。
  • 基金资助:

    山西省自然科学基金资助项目(2010011021-2);山西省回国留学人员科研资助项目(2009-77)

Fuzzy clustering algorithm based on w-mean distance

ZHANG Rui-li,ZHANG Ji-fu   

  1. School of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan Shanxi 030024, China
  • Received:2011-12-31 Revised:2012-03-06 Online:2012-07-05 Published:2012-07-01
  • Contact: ZHANG Rui-li

摘要: 针对模糊C-均值(FCM)算法易陷入局部最优值以及对聚类中心和噪声数据敏感问题,提出了一种基于w-距离均值的模糊聚类算法。首先根据数据自身的分布规律,依据样本间距离均值思想确定初始聚类中心,并引入了调衡因子w来调节距离均值阈值;其次为每个样本赋予权值,并利用样本权值修改了聚类中心公式和目标函数公式,提高了算法的抗噪性;最后实验结果验证了所提算法可以有效地解决聚类效果往往受初始聚类中心的影响的问题,避免了局部收敛,增强了抗噪性,准确率和效率较高。

关键词: 模糊聚类, w-距离均值, 初始聚类中心, 调衡因子, 抗噪性

Abstract: In this paper, a fuzzy clustering algorithm based on w-mean distance was proposed to solve such defects of Fuzzy C-Means (FCM) algorithm as easily falling into local optimal value and being sensitive to clustering center and noise data. First, initial clustering centers were determined by making use of the idea of the mean distance according to the distribution of data set, and the regulating factor w was introduced to adjust the mean distance. Second, each sample in data set was assigned a weight, and the clustering center formula and target function formula were modified by the weight, so that the anti-noise performance was greatly improved for the algorithm. In the end, the experimental results validate that the proposed algorithm has good effects on selecting initial clustering centers, avoiding local convergence, and having higher performance of anti-noise and effectiveness.

Key words: fuzzy clustering, w-mean distance, initial clustering center, regulating factor, anti-noise

中图分类号: