基于自组织映射的流形学习与可视化

doi:10.11772/j.issn.1001-9081.2013.07.1917

计算机应用 ›› 2013, Vol. 33 ›› Issue (07): 1917-1921.DOI: 10.11772/j.issn.1001-9081.2013.07.1917

基于自组织映射的流形学习与可视化

邵超,万春红

河南财经政法大学计算机与信息工程学院,郑州 450002

收稿日期:2013-01-14 修回日期:2013-02-17 出版日期:2013-07-01 发布日期:2013-07-06
通讯作者: 邵超
作者简介:邵超(1977-)，男，河南渑池人，副教授，博士，CCF会员，主要研究方向：人工神经网络、流形学习、数据可视化；万春红(1982-)，女，辽宁抚顺人，讲师，主要研究方向：科技英语、机器翻译。
基金资助:
国家自然科学基金资助项目(61162023);河南省基础与前沿技术研究项目(112300410201);河南省教育厅科学技术研究重点项目基础研究计划(13B520899)

Manifold learning and visualization based on self-organizing map

SHAO Chao,WAN Chunhong

School of Computer and Information Engineering, Henan University of Economics and Law, Zhengzhou Henan 450002, China

Received:2013-01-14 Revised:2013-02-17 Online:2013-07-06 Published:2013-07-01
Contact: SHAO Chao
Supported by:
;the Research Programme of Henan Fundamental and Advanced Technology of China

摘要/Abstract

摘要： 针对自组织映射(SOM)在学习和可视化高维数据内在的低维流形结构时容易产生“拓扑缺陷”的这一问题，提出了一种新的流形学习算法——动态自组织映射(DSOM)。该算法按照数据的邻域结构逐步扩展训练数据集合，对网络进行渐进训练，以避免局部极值，克服“拓扑缺陷”问题；同时，网络规模也随之动态扩展，以降低算法的时间复杂度。实验表明，该算法能更加真实地学习和可视化高维数据内在的低维流形结构；此外，与传统的流形学习算法相比，该算法对邻域大小和噪声也更加鲁棒。所提算法的网络规模和训练数据集合都将按照数据内在的邻域结构进行同步扩展，从而能更加简洁并真实地学习和可视化高维数据内在的低维流形结构。

关键词: 流形学习, 自组织映射, 拓扑缺陷, 局部欧氏性, 邻域结构

Abstract: Self-Organizing Map (SOM) tends to yield the topological defect problem when learning and visualizing the intrinsic low-dimensional manifold structure of high-dimensional data sets. To solve this problem, a manifold learning algorithm, Dynamic Self-Organizing MAP (DSOM), was presented in this paper. In the DSOM, the training data set was expanded gradually according to its neighborhood structure, and thus the map was trained step by step, by which local minima could be avoided and the topological defect problem could be overcome. Meanwhile, the map size was increased dynamically, by which the time cost of the algorithm could be reduced greatly. The experimental results show that DSOM can learn and visualize the intrinsic low-dimensional manifold structure of high-dimensional data sets more faithfully than SOM. In addition, compared with traditional manifold learning algorithms, DSOM can obtain more concise visualization results and be less sensitive to the neighborhood size and the noise, which can also be verified by the experimental results. The innovation of this paper lies in that DSOM expands the map size and the training data set synchronously according to its intrinsic neighborhood structure, by which the intrinsic low-dimensional manifold structure of high-dimensional data sets can be learned and visualized more concisely and faithfully.

Key words: manifold learning, Self-Organizing Map (SOM), topological defect, locally Euclidean nature, neighborhood structure

中图分类号:

TP183

邵超万春红. 基于自组织映射的流形学习与可视化[J]. 计算机应用, 2013, 33(07): 1917-1921.

SHAO Chao WAN Chunhong. Manifold learning and visualization based on self-organizing map[J]. Journal of Computer Applications, 2013, 33(07): 1917-1921.

参考文献

［1］KOHONEN T. Self-organized formation of topologically correct feature maps ［J］. Biological Cybernetics, 1982, 43(1): 59-69.

［2］THALAMUTHU A, MUKHOPADHYAY I, ZHENG X, et al. Evaluation and comparison of gene clustering methods in microarray analysis［J］. Bioinformatics, 2006, 22 (19): 2405-2412.

［3］GHOUILA A, YAHIA S B, MALOUCHE D, et al. Application of Multi-SOM clustering approach to macrophage gene expression analysis［J］. Infection, Genetics and Evolution, 2009, 9(3): 328-336.

［4］王丽敏, 梁艳春, 韩旭明, 等. 多获胜节点SOM及其在股票分析中的应用［J］. 计算机研究与发展, 2008, 45(9): 1493-1500.

［5］SIMILA T. Self-organizing map learning nonlinearly embedded manifolds［J］. Information Visualization, 2005, 4(1): 22-31.

［6］万春红, 邵超. 一种新的基于自组织映射的流形学习算法［J］. 北京交通大学学报, 2009, 33(6): 101-105.
［7］SEUNG H S, LEE D D. The manifold ways of perception［J］. Science, 2000, 290(5500): 2268-2269.

［8］TENENBAUM J B, de SILVA V, LANGFORD J C. A global geometric framework for nonlinear dimensionality reduction ［J］. Science, 2000, 290(5500): 2319-2323.

［9］杨剑, 李伏欣, 王珏. 一种改进的局部切空间排列算法［J］. 软件学报,2005,16(9): 1584-1590.

［10］王耀南, 张莹, 李春生. 基于核矩阵的Isomap增量学习算法研究［J］. 计算机研究与发展, 2009, 46(9): 1515-1522.

［11］ROWEIS S T, SAUL L K. Nonlinear dimensionality reduction by locally linear embedding［J］. Science, 2000, 290(5500): 2323-2326.

［12］ZHANG S. Enhanced supervised locally linear embedding［J］. Pattern Recognition Letters, 2009, 30(13): 1208-1218.

［13］BALASUBRAMANIAN M, SHWARTZ E L, TENENBAUM J B, et al. The ISOMAP algorithm and topological stability［J］. Science, 2002, 295(5552): 7-a.

［14］SAUL L K, ROWEIS S T. Think globally, fit locally: unsupervised learning of low dimensional manifolds［J］. Journal of Machine Learning Research, 2003, 4: 119-155.

［15］詹德川, 周志华. 基于集成的流形学习可视化［J］. 计算机研究与发展, 2005, 42(9): 1533-1537.

［16］曾宪华, 罗四维. 动态增殖流形学习算法［J］. 计算机研究与发展, 2007, 44(9): 1462-1468.

［17］邵超, 黄厚宽, 赵连伟. 一种更具拓扑稳定性的ISOMAP算法［J］. 软件学报, 2007, 18(4): 869-877.

［18］TENENBAUM J B. Mapping a manifold of perceptual observations［C］// Proceedings of the 1997 Conference on Advances in Neural Information Processing Systems. Cambridge: MIT Press, 1998: 682-688.

［19］GUAN H, TURK M. 3D hand pose reconstruction with ISOSOM［C］// Proceedings of the 1st International Symposium on Advances in Visual Computing. Berlin: Springer, 2005: 630-635.

［20］SHI C, ZHANG S, ZHENG Z, et al. Geodesic distance based SOM for image clustering［C］// Proceedings of the 2006 International Conference on Sensing, Computing and Automation. Chongqing: Watam Press, 2006:2483-2488.

［21］OZAKI K, SHIMBO M, KOMACHI M, et al. Using the mutual k-nearest neighbor graphs for semi-supervised classification of natural language data［C］// Proceedings of the 15th Conference on Computational Natural Language Learning. Portland, Oregon: Association for Computational Linguistics, 2011:154-162.

［22］SAXENA A, GUPTA A, MUKERJEE A. Non-linear dimensionality reduction by locally linear Isomaps［C］// Proceedings of the 2004 International Conference on Neural Information Processing. Berlin: Springer, 2004: 1038-1043.

［23］SHAO C, WAN C. Selection of the neighborhood size for manifold learning based on Bayesian information criterion ［J］. Journal of Computational Information Systems, 2012, 8(7): 3043-3050.

[1]	郭羽含, 伊鹏. 长期车辆合乘问题的复合变邻域搜索算法[J]. 计算机应用, 2018, 38(10): 3036-3041.
[2]	范海菊, 刘国奇. 基于核自组织映射的有监督主动轮廓图像分割[J]. 计算机应用, 2016, 36(10): 2832-2836.
[3]	胡彦婷, 王楠楠, 陈建军, 木拉提·哈米提, 阿布都艾尼·库吐鲁克. 基于局部约束邻域嵌入的人脸画像照片合成[J]. 计算机应用, 2015, 35(2): 535-539.
[4]	张成, 刘亚东, 李元. 基于判别式扩散映射分析的非线性特征提取[J]. 计算机应用, 2015, 35(2): 470-475.
[5]	陈达遥陈秀宏. 正交及不相关边界邻域保持嵌入的人脸识别[J]. 计算机应用, 2013, 33(11): 3097-3101.
[6]	刘康钱旭王自强. 基于流形主动学习的遥感图像分类算法[J]. 计算机应用, 2013, 33(02): 326-328.
[7]	石陆魁张军宫晓腾. 基于邻域保持的流形学习算法评价模型[J]. 计算机应用, 2012, 32(09): 2516-2519.
[8]	李冬睿许统德. 自适应邻域选择的数据可分性降维方法[J]. 计算机应用, 2012, 32(08): 2253-2257.
[9]	周雪燕韩建敏詹宇斌. 基于局部平滑性的通用增量流形学习算法[J]. 计算机应用, 2012, 32(06): 1670-1673.
[10]	龚劬华桃桃. 基于改进的局部保持投影算法的人脸识别[J]. 计算机应用, 2012, 32(02): 528-534.
[11]	王铮李兴民. 基于四元数和SOM神经网络的彩色图像边缘检测[J]. 计算机应用, 2012, 32(02): 510-513.
[12]	温金环田铮林伟周敏延伟东. 基于监督局部线性嵌入特征提取的高光谱图像分类[J]. 计算机应用, 2011, 31(03): 715-717.
[13]	李文华. 改进的线性局部切空间排列算法[J]. 计算机应用, 2011, 31(01): 247-249.
[14]	王伟毕笃彦熊磊. 保持全局和局部特性的黎曼流形改进算法[J]. 计算机应用, 2010, 30(12): 3301-3303.
[15]	石陆魁杨庆新. 基于小世界模型的流形学习算法[J]. 计算机应用, 2010, 30(11): 2917-2920.

基于自组织映射的流形学习与可视化

Manifold learning and visualization based on self-organizing map

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics