《计算机应用》唯一官方网站 ›› 2025, Vol. 45 ›› Issue (2): 428-435.DOI: 10.11772/j.issn.1001-9081.2024020202

• 数据科学与技术 • 上一篇    

融合衍生特征的时间序列事件分类方法

张翰林, 王俊陆, 宋宝燕()   

  1. 辽宁大学 信息学院,沈阳 110036
  • 收稿日期:2024-02-29 修回日期:2024-05-14 接受日期:2024-05-17 发布日期:2024-06-04 出版日期:2025-02-10
  • 通讯作者: 宋宝燕
  • 作者简介:张翰林(1993—),男,辽宁沈阳人,博士研究生,CCF会员,主要研究方向:时序图查询、机器学习
    王俊陆(1988—),男,辽宁沈阳人,讲师,博士,CCF会员,主要研究方向:深度学习、区块链;
  • 基金资助:
    国家重点研发计划项目(2021YFF0901004);辽宁省应用基础研究计划项目(2022JH2/101300250);辽宁省教育厅高校基本科研项目(理工类)面上项目(揭榜挂帅服务地方项目)(JYTMS20230761)

Time series event classification method fused with derived features

Hanlin ZHANG, Junlu WANG, Baoyan SONG()   

  1. School of Information,Liaoning University,Shenyang Liaoning 110036,China
  • Received:2024-02-29 Revised:2024-05-14 Accepted:2024-05-17 Online:2024-06-04 Published:2025-02-10
  • Contact: Baoyan SONG
  • About author:ZHANG Hanlin, born in 1993, Ph. D. candidate. His research interests include temporal graph query, machine learning.
    WANG Junlu, born in 1988, Ph. D., lecturer. His research interests include deep learning, blockchain.
  • Supported by:
    National Key Research and Development Program of China(2021YFF0901004);Liaoning Province Applied Basic Research Program(2022JH2/101300250);General Project of Educational Department of Liaoning Province’s Higher Education Institution Basic Research Project(Engineering┫ (Leading the Way to Serve Local Projects) ┣JYTMS20230761)

摘要:

时间序列分类是时间序列分析的基础。然而,现有的时间序列分类方法对应的形态特征并不能作为分类依据,且通道间的特征通过图上的单一权重刻画不够准确,导致分类精度不高。因此,提出一种融合衍生特征的时间序列事件分类方法(TSEC-FDF)。首先,在时间序列上构建时间序列事件集合后,根据每个时间序列事件构建突变图、协同图、启发图,以减少噪声对高维特征的干扰;其次,融合多图的特征作为衍生特征,并抽取时间序列事件的多个时间级别的特征;最后,提出一种融合衍生特征的多图卷积分类模型级联时间序列和图特征作为时间序列事件的高维特征。实验结果表明,与TF-C(Time-Frequency Consistency)和BiLSTM+隐马尔可夫模型(Bi-directional Long Short-Term Memory-Hidden Markov Model, BL-HMM)方法相比,TSEC-FDF在4个真实数据集上的准确率、精确率、查全率、F1值、AUROC(Area Under the Receiver Operating Characteristic curve)以及AUPRC(Area Under the Precision versus Recall Curve)至少提升了3.2%、4.7%、7.8%、6.3%、0.9%和2.2%。

关键词: 转换图, 衍生特征, 图卷积神经网络, 多图融合, 时间序列分类, 图构建

Abstract:

Time series classification is the foundation of time series analysis. However, the morphological features corresponding to the existing time series classification methods cannot serve as the basis for classification, and the features between channels are not characterized accurately by the single weight on the graph, resulting in low classification accuracy. Therefore, a Time Series Event Classification method Fused with Derived Features (TSEC-FDF) was proposed. Firstly, after constructing a time series event set on the time series, the mutation graphs, collaborative graphs, and heuristic graphs were constructed on the basis of each time series event to reduce noise interference to high-dimensional features. Secondly, the features of multiple graphs were fused and treated as derived features, and features of time series events at multiple time levels were extracted. Finally, a multi-graph convolutional classification model fusing derived features was proposed, where time series and graph features were cascaded as high-dimensional features of time series events. Experimental results show that TSEC-FDF improves the accuracy, precision, recall, F1 score, AUROC(Area Under the Receiver Operating Characteristic) curve, and AUPRC(Area Under the Precision versus Recall Curve) on 4 real datasets by 3.2%, 4.7%, 7.8%, 6.3%, 0.9%, and 2.2%, at least, compared to TF-C (Time-Frequency Consistency) and Bi-directional Long Short-Term Memory-Hidden Markov Model (BL-HMM) methods.

Key words: transfer graph, derived feature, graph convolutional neural network, multi-graph fusion, time series classification, graph construction

中图分类号: