Journal of Computer Applications ›› 2019, Vol. 39 ›› Issue (7): 1869-1882.DOI: 10.11772/j.issn.1001-9081.2019010174

• Artificial intelligence •     Next Articles

Review of clustering algorithms

ZHANG Yonglai, ZHOU Yaojian   

  1. Software School, North University of China, Taiyuan Shanxi 030051, China
  • Received:2019-01-23 Revised:2019-04-09 Online:2019-04-15 Published:2019-07-10
  • Supported by:

    This work is partially supported by the National Natural Science Foundation of China (6160051296).

聚类算法综述

章永来, 周耀鉴   

  1. 中北大学 软件学院, 太原 030051
  • 通讯作者: 周耀鉴
  • 作者简介:章永来(1978-),男,浙江诸暨人,助理教授,博士,主要研究方向:大数据分析与处理、医疗大数据、海洋大数据;周耀鉴(1987-),男,湖北武穴人,助理教授,博士,主要研究方向:大数据分析与处理、海洋大数据、水下机器人。
  • 基金资助:

    国家自然科学基金资助项目(6160051296)。

Abstract:

Clustering is very important as an unsupervised learning algorithm in the age of big data. Recently, considerable progress has been made in the analysis of clustering algorithm. Firstly, the whole process of clustering, similarity measurement, new classification of clustering algorithms and evaluation on their results were summarized. Clustering algorithms were divided into two categories:big data clustering and small data clustering, and the systematic analysis and summary of big data clustering were carried out particularly. Moreover, the research progress and application of various clustering algorithms were summarized and analyzed, and the development trend of clustering algorithms was discussed in combination with the research topics.

Key words: clustering, similarity measurement, big data clustering, small data clustering, clustering evaluation

摘要:

大数据时代,聚类这种无监督学习算法的地位尤为突出。近年来,对聚类算法的研究取得了长足的进步。首先,总结了聚类分析的全过程、相似性度量、聚类算法的新分类及其结果的评价等内容,将聚类算法重新划分为大数据聚类与小数据聚类两个大类,并特别对大数据聚类作了较为系统的分析与总结。此外,概述并分析了各类聚类算法的研究进展及其应用概况,并结合研究课题讨论了算法的发展趋势。

关键词: 聚类, 相似性度量, 大数据聚类, 小数据聚类, 聚类评价

CLC Number: