计算机应用 ›› 2012, Vol. 32 ›› Issue (11): 3005-3008.DOI: 10.3724/SP.J.1087.2012.03005

• 人工智能 • 上一篇    下一篇

基于全局性分裂算子的进化K-means算法

王留正,何振峰   

  1. 福州大学 数学与计算机科学学院,福州 350108
  • 收稿日期:2012-05-07 修回日期:2012-06-19 发布日期:2012-11-12 出版日期:2012-11-01
  • 通讯作者: 王留正
  • 作者简介:王留正(1984-),男,河南太康人,硕士研究生,主要研究方向:数据挖掘、机器学习;何振峰(1971-),男,安徽石台人,副教授,博士,主要研究方向:机器学习及其农业应用。
  • 基金资助:
    国家自然科学基金资助项目(60805042);福建省教育厅科技项目(JA11015)

Evolutionary K-means algorithm based on global splitting operator

WANG Liu-zheng,HE Zhen-feng   

  1. School of Mathematics and Computer Science, Fuzhou University, Fuzhou Fujian 350108, China
  • Received:2012-05-07 Revised:2012-06-19 Online:2012-11-12 Published:2012-11-01
  • Contact: WANG Liu-zheng

摘要: 进化算法可以有效地克服Kmeans对初始聚类中心敏感的缺陷,提高了聚类性能。在进化Kmeans聚类算法 (F-EAC)的基础上,针对其变异操作——簇分裂算子的随机性与局部性,提出了两个全局性分裂算子。结合最大最小距离的思想,利用待分裂簇的周边簇信息来指导簇分裂初始点的选择,使簇的分裂更有利于全局划分,以进一步提高进化聚类的有效性。实验结果表明,基于全局性分裂算子的算法在类数发现及聚类精度方面均优于FEAC。

关键词: K-means, 进化算法, 变异算子, 全局分裂, 最大最小距离

Abstract: Evolutionary Algorithm (EA) can effectively overcome the drawback that Kmeans is sensitive to the initial clustering centers, thus enhancing the clustering performance. On the basis of evolutionary Kmeans clustering algorithm (FEAC), considering the randomness and locality in the splitting operator as a mutation operation, two improved splitting operators with global information (global splitting operator) were proposed. The idea of maxmin distance and the information of peripheral clusters were used to guide the selection of the initial splitting centers, in order to make splitting process more beneficial to global partition, furthermore, to improve the efficiency of the evolutionary clustering. The experimental results show that the improved algorithms based on global splitting operator outperform FEAC in terms of clusters number discovering and clustering accuracy.

Key words: Kmeans, Evolutionary Algorithm (EA), mutation operator, global splitting, maxmin distance

中图分类号: