计算机应用 ›› 2017, Vol. 37 ›› Issue (11): 3207-3211.DOI: 10.11772/j.issn.1001-9081.2017.11.3207

• 2017年中国计算机学会人工智能会议(CCFAI 2017) • 上一篇    下一篇

基于复杂学习分类系统的密度聚类方法

黄虹玮1, 葛笑天2, 陈烜松2   

  1. 1. 计算机软件新技术国家重点实验室(南京大学), 南京 210023;
    2. 江苏省审计厅, 南京 210009
  • 收稿日期:2017-05-11 修回日期:2017-07-05 出版日期:2017-11-10 发布日期:2017-11-11
  • 通讯作者: 黄虹玮
  • 作者简介:黄虹玮(1986-),男,陕西汉中人,博士研究生,主要研究方向:在线聚类、学习分类系统;葛笑天(1963-),男,江苏徐州人,高级审计师,硕士,CCF会员,主要研究方向:政务大数据、审计信息化;陈烜松(1971-),男,江苏南京人,高级审计师,硕士,主要研究方向:审计大数据应用、多维数据分析。
  • 基金资助:
    江苏省重点研发计划(产业前瞻与共性关键技术)项目(BE2015213)。

Density clustering method based on complex learning classification system

HUANG Hongwei1, GE Xiaotian2, CHEN Xuansong2   

  1. 1. State Key Laboratory for Novel Software Technology(Nanjing University), Nanjing Jiangsu 210023, China;
    2. Jiangsu Provincial Audit Office, Nanjing Jiangsu 210009, China
  • Received:2017-05-11 Revised:2017-07-05 Online:2017-11-10 Published:2017-11-11
  • Supported by:
    This work is partially supported by the Key Research and Development Program (Industry Forward and Common Key Technology) Project of Jiangsu Province (BE2015213).

摘要: 提出一种基于复杂学习分类系统(XCS)的密度聚类方法,可以用于对任意形状且带有噪声的二维数据进行聚类分析。此方法称为DXCSc,主要包括以下三个过程:1)基于学习分类系统,对输入数据生成规则种群,并对规则进行适当压缩;2)将已经生成的规则视为二维数据点,进而基于密度聚类思想对二维数据点进行聚类;3)对密度聚类后的规则种群进行适当聚合,生成最终的规则种群。在第一个过程中,采用学习分类系统框架生成规则种群并进行适当约减。第二个过程认为种群的各规则簇中心比它们的邻居规则具有更高的密度,并且与密度更高的规则间距离更大。在第三个过程中,采用图分割方法对相关重叠簇进行适当聚合。在实验中,将所提方法与K-means、近邻传播聚类算法(AP)、Voting-XCSc等算法进行了比较,实验结果表明,所提方法在精度方面优于对比算法。

关键词: 学习分类系统, 进化计算, 强化学习, 密度聚类, 规则合并

Abstract: A density clustering method based on eXtended Classifier Systems (XCS) was proposed, which could be used to cluster the two-dimensional data sets with arbitrary shapes and noises. The proposed method was called Density XCS Clustering (DXCSc), which mainly included the following three processes:1) Based on the learning classification system, regular population of input data was generated and compressed. 2) The generated rules were regarded as two-dimensional data points, and then the two-dimensional data points were clustered based on idea of density clustering. 3) The regular population after density clustering was properly aggregated to generate the final regular population. In the first process, the learning classifier system framework was used to generate and compact the regular population. In the second process, the rule cluster centers were characterized by a higher density than their neighbors and by a relatively large distance from points with higher densities. In the third process, the relevant clusters were properly merged using the graph segmentation method. In the experiments, the proposed DXCSc was compared with K-means, Affinity Propagation (AP) and Voting-XCSc on a number of challenging data sets. The experimental results show that the proposed approach outperforms K-means and Voting-XCSc in precision.

Key words: Learning Classifier System (LCS), evolutionary computing, reinforcement learning, density clustering, rule merging

中图分类号: