• •    

一种快速识别密度骨架的聚类算法

邱保志,唐雅敏   

  1. 郑州大学 信息工程学院
  • 收稿日期:2017-06-01 修回日期:2017-07-25 发布日期:2017-07-25
  • 通讯作者: 唐雅敏

Efficient clustering based on density backbone

  • Received:2017-06-01 Revised:2017-07-25 Online:2017-07-25
  • Contact: Ya-Min TANG

摘要: 为了快速寻找密度骨架和提高对高维数据聚类结果的准确性,提出了一种新的基于k近邻的局部密度计算方法和快速识别高密度骨架的聚类算法ECLUB(Efficient CLUstering based on density Backbone)。算法首先根据互k近邻一致性及近邻点局部密度关系,快速识别出高密度骨架,其次未分配的低密度点依据邻近关系进行划分,得到最终聚类。人工合成数据集及真实数据集的实验验证了算法的有效性,与同类算法相比,ECLUB算法具有高效、对高维数据聚类准确率高的优势。

关键词: 聚类算法, 高维数据, k近邻, 密度骨架, 局部密度

Abstract: In order to find density backbone quickly and improve the accuracy of the high-dimensional data clustering results, a new algorithm based on the method of local density computation and fast recognition of high-density backbone is put forward, which is called ECLUB. Firstly, the high density backbone is identified quickly according to the mutual consistency of k neighbors and the local density relation of neighbor points. Then the unassigned low-density points are divided according to neighborhood relations to obtain the final clustering. Experiments on synthetic datasets and real datasets show the effectiveness of the proposed algorithm. The ECLUB algorithm has the advantages of high efficiency and high accuracy for high-dimensional data clustering compared with the similar algorithms.

Key words: clustering algorithm, high dimension data, k nearest neighbors, density backbone, local density

中图分类号: