计算机应用 ›› 2010, Vol. 30 ›› Issue (12): 3380-3384.

• 数据库与数据挖掘 • 上一篇    下一篇

关系数据的中心权重模糊聚类算法

贺杨成1,王士同1,江南2   

  1. 1. 江南大学
    2.
  • 收稿日期:2010-05-14 修回日期:2010-07-22 发布日期:2010-12-22 出版日期:2010-12-01
  • 通讯作者: 贺杨成

Weighted fuzzy clustering algorithm for relational data with multiple medoids

  • Received:2010-05-14 Revised:2010-07-22 Online:2010-12-22 Published:2010-12-01
  • Contact: HE YANGCheng

摘要: k中心点算法仅仅用一个点去代表整个类显然是不足的,这必然会影响聚类结果的准确性。因此提出了一种关系数据的中心权重模糊聚类算法,在该算法中给每一个属于这个类的对象赋予一个中心权重以此来表示其作为这个类的代表对象的可能性程度,这种机制使类中的多个对象来代表整个类而不是利用类中的一个对象来代表整个类。实验结果表明,该算法能更好地发现数据集中潜在的内部结构及对象之间的关系,得到每个聚类结果更加准确的描述。

关键词: 中心权重, 模糊划分, 关系数据, 非相似性

Abstract: It is apparently inadequate that k center algorithm uses only one point to represent the whole class, which will definitely affect the accuracy of clustering results. Therefore, a weighted fuzzy clustering algorithm for relational data with multiple medoids was proposed. In the proposed algorithm, multiple objects in each cluster carried different weights called medoids weights to represent their degrees of representativeness in that cluster. This algorithm can make each cluster to be represented by multiple objects instead of only one object. The experimental results show that the proposed algorithm can capture the underlying structures of the data more accurately and provide richer information for the description of the resulting clusters.

Key words: weighted medoid, fuzzy partition, relational data, dissimilarity