一种基于密度的高效聚类算法

doi:10.3724/SP.J.1087.2005.01824

计算机应用 ›› 2005, Vol. 25 ›› Issue (08): 1824-1826.DOI: 10.3724/SP.J.1087.2005.01824

一种基于密度的高效聚类算法

石陆魁^1,2，何丕廉¹

1.天津大学计算机科学与技术系； 2.工业大学计算机科学与软件学院

发布日期:2011-04-07 出版日期:2005-08-01
基金资助:
天津市科技发展计划资助项目(04310941R)

Efficient density-based clustering algorithm

SHI Lu-kui^1,2,HE Pi-lian¹

1.Department of Computer Science and Technology,Tianjin University,Tianjin 300072,China; 2.School of Computer Science and Software,Hebei University of Technology,Tianjin 300130,China

Online:2011-04-07 Published:2005-08-01

摘要/Abstract

摘要： 在聚类算法DBSCAN(DensityBasedSpatialClusteringofApplicationswithNoise)的基础上,提出了一种基于密度的高效聚类算法。该算法首先对样本集按某一维排序,然后通过在核心点的邻域外按顺序选择一个未标记的样本点来扩展种子点,以便减少查询次数,降低聚类的时间花费。对样本进行非线性核变换后再进行聚类可以有效地改善聚类的质量。理论分析表明,该算法的时间复杂性接近于线性复杂度。同时测试结果也表明新算法的时间复杂度和聚类质量都显著优于DBSCAN算法。

关键词: 聚类分析, DBSCAN, 核变换

Abstract: An efficient density-based clustering algorithm was presented based on DBSCAN(Density Based Spatial Clustering of Applications with Noise). In this method, objects were sorted by a certain dimensional coordinate at first. Then the new algorithm selected in order unlabelled points outside a core objects neighborhood as seeds to expand clusters so that the execution frequency of region queries could be decreased, and consequently the time cost was diminished. Transforming objects with a non-linear kernel function could effectively improve the clustering accuracy. The theoretic analysis demonstrates that the time complexity of this algorithm is approximately linear. Experimental results also show that the time efficiency and the clustering quality of the new algorithm are greatly superior to those of the original DBSCAN.

Key words: clustering analysis, DBSCAN, kernel transformation

中图分类号:

TP301.6

石陆魁，何丕廉. 一种基于密度的高效聚类算法[J]. 计算机应用, 2005, 25(08): 1824-1826.

SHI Lu-kui,HE Pi-lian. Efficient density-based clustering algorithm[J]. Journal of Computer Applications, 2005, 25(08): 1824-1826.

[1]	戴嫣然, 戴国庆, 袁玉波. 基于肤色学习的多人脸前景抽取方法[J]. 计算机应用, 2021, 41(6): 1659-1666.
[2]	郭佳, 韩李涛, 孙宪龙, 周丽娟. 自动确定聚类中心的比较密度峰值聚类算法[J]. 计算机应用, 2021, 41(3): 738-744.
[3]	陈港, 孟相如, 康巧燕, 阳勇. 基于拓扑分割与聚类分析的虚拟软件定义网络映射算法[J]. 《计算机应用》唯一官方网站, 2021, 41(11): 3309-3318.
[4]	任帅, 徐振超, 王震, 贺媛, 张弢, 苏东旭, 慕德俊. 基于多融合态的低密度三维模型信息隐藏算法[J]. 计算机应用, 2019, 39(4): 1100-1105.
[5]	孙石磊, 王超, 赵元棣. 基于轮廓系数的参数无关空中交通轨迹聚类方法[J]. 计算机应用, 2019, 39(11): 3293-3297.
[6]	王成, 崔紫薇, 杜梓林, 高悦尔. 基于DBSCAN算法和多源数据的缺失公交到站数据修补[J]. 计算机应用, 2019, 39(11): 3184-3190.
[7]	陆明炽, 王守华, 李云柯, 纪元法, 孙希延, 邓桂辉. 基于特征匹配和距离加权的蓝牙定位算法[J]. 计算机应用, 2018, 38(8): 2359-2364.
[8]	任帅, 张弢, 徐振超, 王震, 贺媛, 柳雨农. 特征点标注与聚类的三维模型信息隐藏算法[J]. 计算机应用, 2018, 38(4): 1017-1022.
[9]	李晔, 陈奕延, 张淑芬. 基于密度峰值的混合型数据聚类算法设计[J]. 计算机应用, 2018, 38(2): 483-490.
[10]	徐晓伟, 杜一, 周园春. 基于多源出行数据的居民行为模式分析方法[J]. 计算机应用, 2017, 37(8): 2362-2367.
[11]	梁双, 周丽华, 杨培忠. 基于聚类分析分库策略的社交网络数据库查询性能与数据迁移[J]. 计算机应用, 2017, 37(3): 673-679.
[12]	金亮, 于炯, 杨兴耀, 鲁亮, 王跃飞, 国冰磊, 廖彬. 基于聚类层次模型的视频推荐算法[J]. 计算机应用, 2017, 37(10): 2828-2833.
[13]	谢洪安, 李栋, 苏旸, 杨凯. 基于聚类分析的可信网络管理模型[J]. 计算机应用, 2016, 36(9): 2447-2451.
[14]	王少华, 狄岚, 梁久祯. 基于核与局部信息的多维度模糊聚类图像分割算法[J]. 计算机应用, 2015, 35(11): 3227-3231.
[15]	曹永春蔡正琦邵亚斌. 基于K-means的改进人工蜂群聚类算法[J]. 计算机应用, 2014, 34(1): 204-207.

一种基于密度的高效聚类算法

Efficient density-based clustering algorithm

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics