计算机应用 ›› 2013, Vol. 33 ›› Issue (07): 1917-1921.DOI: 10.11772/j.issn.1001-9081.2013.07.1917

• 人工智能 • 上一篇    下一篇

基于自组织映射的流形学习与可视化

邵超,万春红   

  1. 河南财经政法大学 计算机与信息工程学院,郑州 450002
  • 收稿日期:2013-01-14 修回日期:2013-02-17 出版日期:2013-07-01 发布日期:2013-07-06
  • 通讯作者: 邵超
  • 作者简介:邵超(1977-),男,河南渑池人,副教授,博士,CCF会员,主要研究方向:人工神经网络、流形学习、数据可视化;万春红(1982-),女,辽宁抚顺人,讲师,主要研究方向:科技英语、机器翻译。
  • 基金资助:

    国家自然科学基金资助项目(61162023);河南省基础与前沿技术研究项目(112300410201);河南省教育厅科学技术研究重点项目基础研究计划(13B520899)

Manifold learning and visualization based on self-organizing map

SHAO Chao,WAN Chunhong   

  1. School of Computer and Information Engineering, Henan University of Economics and Law, Zhengzhou Henan 450002, China
  • Received:2013-01-14 Revised:2013-02-17 Online:2013-07-06 Published:2013-07-01
  • Contact: SHAO Chao
  • Supported by:

    ;the Research Programme of Henan Fundamental and Advanced Technology of China

摘要: 针对自组织映射(SOM)在学习和可视化高维数据内在的低维流形结构时容易产生“拓扑缺陷”的这一问题,提出了一种新的流形学习算法——动态自组织映射(DSOM)。该算法按照数据的邻域结构逐步扩展训练数据集合,对网络进行渐进训练,以避免局部极值,克服“拓扑缺陷”问题;同时,网络规模也随之动态扩展,以降低算法的时间复杂度。实验表明,该算法能更加真实地学习和可视化高维数据内在的低维流形结构;此外,与传统的流形学习算法相比,该算法对邻域大小和噪声也更加鲁棒。所提算法的网络规模和训练数据集合都将按照数据内在的邻域结构进行同步扩展,从而能更加简洁并真实地学习和可视化高维数据内在的低维流形结构。

关键词: 流形学习, 自组织映射, 拓扑缺陷, 局部欧氏性, 邻域结构

Abstract: Self-Organizing Map (SOM) tends to yield the topological defect problem when learning and visualizing the intrinsic low-dimensional manifold structure of high-dimensional data sets. To solve this problem, a manifold learning algorithm, Dynamic Self-Organizing MAP (DSOM), was presented in this paper. In the DSOM, the training data set was expanded gradually according to its neighborhood structure, and thus the map was trained step by step, by which local minima could be avoided and the topological defect problem could be overcome. Meanwhile, the map size was increased dynamically, by which the time cost of the algorithm could be reduced greatly. The experimental results show that DSOM can learn and visualize the intrinsic low-dimensional manifold structure of high-dimensional data sets more faithfully than SOM. In addition, compared with traditional manifold learning algorithms, DSOM can obtain more concise visualization results and be less sensitive to the neighborhood size and the noise, which can also be verified by the experimental results. The innovation of this paper lies in that DSOM expands the map size and the training data set synchronously according to its intrinsic neighborhood structure, by which the intrinsic low-dimensional manifold structure of high-dimensional data sets can be learned and visualized more concisely and faithfully.

Key words: manifold learning, Self-Organizing Map (SOM), topological defect, locally Euclidean nature, neighborhood structure

中图分类号: