计算机应用 ›› 2014, Vol. 34 ›› Issue (1): 239-243.DOI: 10.11772/j.issn.1001-9081.2014.01.0239

• 人工智能 • 上一篇    下一篇

基于奇异值分解和判别局部保持投影的多变量时间序列分类

董红玉,陈晓云   

  1. 福州大学 数学与计算机科学学院, 福州 350108
  • 收稿日期:2013-06-03 修回日期:2013-08-06 出版日期:2014-01-01 发布日期:2014-02-14
  • 通讯作者: 陈晓云
  • 作者简介:董红玉(1987-),女,河南安阳人,硕士研究生,主要研究方向:数据挖掘、模式识别;陈晓云(1970-),女,福建晋江人,教授,博士,主要研究方向:模式识别、机器学习。
  • 基金资助:

    国家自然科学基金资助项目;福建省优秀人才支持计划项目

Classification of multivariate time series based on singular value decomposition and discriminant locality preserving projection

DONG Hongyu,CHEN Xiaoyun   

  1. College of Mathematics and Computer Science, Fuzhou University, Fuzhou Fujian 350108, China
  • Received:2013-06-03 Revised:2013-08-06 Online:2014-01-01 Published:2014-02-14
  • Contact: CHEN Xiaoyun

摘要: 针对现有多变量时间序列分类算法存在的要求序列等长和忽视类别信息两个不足,提出基于奇异值分解(SVD)和判别局部保持投影的分类算法。该算法基于降维思想,先通过SVD将样本的第一右奇异向量作为特征向量,以此将不等长序列转化为规模大小相同的序列;接着采用基于最大间距准则的判别局部保持投影对特征向量投影,充分利用类别信息以确保投影后同类样本尽量接近,异类样本尽量分散;最后在低维子空间采用1最近邻(1NN)、Parzen窗、支持向量机(SVM)和朴素Bayes分类器进行分类。在Australian Sign Language(ASL)、Japanese Vowels(JV)和Wafer三个公开的多变量时间序列数据集上进行的实验结果表明:在时间开销基本不变的前提下,所提方法取得了较低的分类错误率。

关键词: 多变量时间序列, 分类, 奇异值分解, 判别局部保持投影, 最大间距准则

Abstract: The existing multivariate time series classification algorithms require sequences of equal length and neglect categories information. In order to solve these defects, a multivariate time series classification algorithm was proposed based on Singular Value Decomposition (SVD) and discriminant locality preserving projection. Based on the idea of dimension reduction, the first right singular vector of samples by SVD was used as feature vector to transform unequal length sequence into a sequence of identical size. Then the feature vector was projected by utilizing discriminant locality preserving projection based on maximum margin criterion, which made full use of categories information to ensure samples of the same class as close as possible and heterogeneous samples as dispersed as possible. Finally, it achieved the classification in a low dimension subspace by using 1 Nearest Neighbor (1NN), Parzen windows, Support Vector Machine (SVM) and Naive Bayes classifier. Experiments were carried out on Australian Sign Language (ASL), Japanese Vowels (JV) and Wafer, the three public multivariate time series datasets. The results show that the proposed algorithm achieves lower classification error rate under the condition of the same time complexity basically.

Key words: multivariate time series, classification, Singular Value Decomposition (SVD), Discriminant Locality Preserving Projection (DLPP), maximum margin criterion

中图分类号: