• •    

CCDM2022+219+判别多维标度特征学习

唐海涛1,王红军1,李天瑞2   

  1. 1. 西南交通大学
    2. 西南交通大学 信息科学与技术学院,成都 610031;
  • 收稿日期:2022-04-01 修回日期:2022-05-16 发布日期:2022-06-29
  • 通讯作者: 王红军
  • 基金资助:
    国家重点研发计划;国家自然科学基金项目

Discriminative multidimensional scaling for feature learning(CCDM2022+219)

  • Received:2022-04-01 Revised:2022-05-16 Online:2022-06-29
  • Supported by:
    National Key Research and Development Program of China;National Natural Science Foundation of China

摘要: 传统多维标度学习得到的低维嵌入保持了数据点的拓扑结构,但忽略了低维嵌入数据类别间的判别性。基于此,提出了判别多维标度模型,在学习低维数据表示的同时发现簇结构,并通过同簇的低维嵌入更加接近,使得学习到的数据表示更加具有判别性。首先,设计了判别多维标度模型所对应的目标公式,目标公式体现了学习的拓扑性和判别性;其次,对目标函数进行了推理和求解,并根据推理过程设计了所对应的算法;最后,采用12公开的数据集和2个评价标准进行对比实验。实验结果表明,所提出的算法的低维嵌入更加具有判别性,在12个数据集上聚类平均准确率相较于原始空间和传统多维标度模型分别提升5.28%和6.21%;平均纯度分别提高1.59%和1.72%。

关键词: 判别性特征学习, 多维标度法, 降维, 模糊聚类, 迭代优化算法

Abstract: Traditional multidimensional scaling method achieves low dimensional embedding, which maintains the topological structure of data points but ignores the discriminability of the low dimensional embedding itself. Based on this, a novel multidimensional scaling model referred to as Discriminative Multidimensional Scaling (DMDS) was proposed to discover the cluster structure while learning the low dimensional data representation. DMDS model can make the low dimensional embedding of the same cluster closer to make the learned data representation be more discriminative. Firstly, a novel objective function corresponding to DMDS model was designed, which maintains the topology and enhances discriminability simultaneously. Secondly, the objective function was reasoned and solved, and an iterative optimization algorithm was proposed according to the reasoning process. Finally, extensive experiments were carried out on twelve open data sets, and two evaluation indices were used to evaluate the experiment results. The experimental results show that the low dimensional embedding learned by DMDS model is more discriminative. On the twelve data sets, the average accuracy of clustering is improved by 5.28% and 6.21% respectively compared with the original data representation and the traditional multidimensional scaling, and the average purity is improved by 1.59% and 1.72% respectively.

Key words: discriminative feature learning, multidimensional scaling, dimensionality reduction, fuzzy clustering, iterative optimization algorithm

中图分类号: