计算机应用 ›› 2012, Vol. 32 ›› Issue (06): 1654-1656.DOI: 10.3724/SP.J.1087.2012.01654

• 人工智能 • 上一篇    下一篇

分类数据的聚类边界检测技术

邱保志,王波   

  1. 郑州大学 信息工程学院,郑州 450001
  • 收稿日期:2011-11-24 修回日期:2012-01-20 发布日期:2012-06-04 出版日期:2012-06-01
  • 通讯作者: 王波
  • 作者简介:邱保志(1964-),男,河南驻马店人,教授,博士,主要研究方向:数据挖掘;〓王波(1979-),男,河南安阳人,硕士研究生,主要研究方向:数据挖掘。
  • 基金资助:
    河南省重点科技攻关项目;河南省教育厅自然科学研究计划项目

Cluster boundary detection technology for categorical data

QIU Bao-zhi,WANG Bo   

  1. School of Information Engineering,Zhengzhou University, Zhengzhou Henan 450001,China
  • Received:2011-11-24 Revised:2012-01-20 Online:2012-06-04 Published:2012-06-01
  • Contact: WANG Bo

摘要: 随着分类属性数据集的应用越来越广泛,获取含有分类属性数据集的聚类边界的需求也越来越迫切。为了获取聚类的边界,在定义分类数据的边界度和聚类边界的基础上,提出了一种带分类属性数据的聚类边界检测算法——CBORDER。该算法首先利用随机分配初始聚类中心和边界度对类进行划分并获取记录边界点的证据,然后运用证据积累的思想多次执行该过程来获取聚类的边界。实验结果表明,CBORDER算法能有效地检测出高维分类属性数据集中聚类的边界。

关键词: 边界度, 证据积累, 聚类边界, 分类数据

Abstract: With the wide application of categorical-attribute dataset, the demand of obtaining the cluster boundary of categorical-attribute dataset becomes more and more urgent. In order to get cluster boundaries, the individual proposed a categorical-attribute data boundary detection algorithm: CBORDER(Categorical dataset BORDER detection algorithm) In this algorithm, firstly, initializing the center of cluster by using random allocation and utilizing boundary-degree to partition clusters, at the same time, we get the evidence of capturing boundary recorders. Then, basing on the evidence accumulation, we execute the above procedure repeatedly to acquire the boundaries of clusters at the end. Multi-experimental results demonstrate that CBORDER detect boundaries of the high dimension categorical data efficiently.

Key words: boundary-degree, evidence accumulation, boundary of cluster, categorical dataset

中图分类号: